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The  regularity  observed  in  natural  processes  can  often  be  expressed 
and  explained  in  mathematical  language.  Galileo  put  forth  this  idea 
in  stating  that  the  language  of  science  is  mathematics.  In  many 
scientific  fields  mathematical  descriptions  have  provided  the  tools 
for  fundamental  advances  and  important  discoveries.  These  conferences, 
which  are  sponsored  by  the  Army  Mathematics  Steering  Committee  (AMSC), 
provide  an  opportunity  for  engineers,  physicists  and  other  scientists 
to  get  together  with  mathematicians  to  originate  conceptual  and 
analytic  tools  to  treat  problems  in  these  various  fields. 

As  in  previous  meetings,  the  Twenty-Third  Conference  of  Army  Mathe- 
maticians gave  its  attendees  a chance  to  see  the  developments  taking 
place  in  the  various  Army  laboratories.  The  Army  scientist's  contrib- 
utions covered  a broad  spectrum  of  scientific  areas.  Through  these 
meetings,  techniques  developed  at  one  installation  are  brought  to  the 
attention  of  scientists  at  other  places,  thus  reducing  duplication  of 
effort.  Another  important  phase  of  these  meetings  is  presenting  the 
members  of  the  audience  an  opportunity  to  hear  nationally  known 
scientists  discuss  recent  developments  in  their  own  fields.  This  year 
there  were  five  invited  speakers  who  gave  addresses  in  the  areas  listed 
below,  and  who  were  more  than  willing  to  discuss  various  problems  with 
scientists  in  the  Army  agencies. 

Speaker  and  Institution 

Prof.  M.D.  Kruskal 
Princeton  University 

Prof.  D.H.  Sattinger 
University  of  Minnesota 

Prof.  Mike  Crandall 
Mathematics  Research  Center 

Prof.  H.O.  Kreiss 
Uppsala  University,  Sweden 
(Visiting  NYU) 

Prof.  Edward  Kamen 
Georgia  Institute  of 
Technology 
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What's  All  This  About  Soli tons 


Group  Theoretic  Methods  in 
Bifurcation  Theory 

Evolution  Governed  by  Accretive 
Operators 

Numerical  Solution  of  Problems 
with  Different  Time  Scales 


Use  of  Algebraic  Methods  in  the 
Design  of  Controllers  and  Observers 
for  Systems  with  Time  Delays 
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The  success  of  the  the  benefits  derived  from  these  conferences 
depend  a great  deal  on  the  host  installation.  This  year  we  were 
pleased  to  have  the  U.S.  Army  Air  Mobility  Research  and  Development 
Laboratory,  Langley  Directorate,  Hampton,  Virginia,  serve  in  this 
capacity.  Those  in  attendance  were  fortunate  to  have  Mr.  Robert  L.  Tomaine 
as  Chairman  on  Local  Arrangements.  He,  together  with  members  of  his 
staff,  provided  all  those  things,  such  as  projection  equipment,  travel 
information,  etc.,  needed  for  a successful  symposium. 

The  AMSC  is  pleased  to  be  able  to  provide  the  Proceedings  of  this  Con- 
ference. It  is  hoped  the  scientific  ideas  contained  herein  will  benefit, 
not  only  those  who  were  able  to  attend  the  conference,  but  also  many 
others  that  did  not  enjoy  that  privilege. 
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NONNEUTRAL  PLASMA  EXPANSION 


K.E.  Lonngren* 
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University  of  Wisconsin 
Madison,  Wisconsin 


ABSTRACT 


The  self  similar  expansion  of  nonneutral  plasmas  is  examined  using  i 

various  models.  Analytical  solutions  are  obtained  for:  (1)  The  | 

expansion  of  a Maxwellian  electron  cloud  governed  by  Ohm's  Law;  , 

(2)  The  expansion  of  an  electron  cloud  using  a mobility  model;  | 

(3)  The  expansion  of  an  electron  cloud  using  a cold  plasma  model;  | 

and  (4)  The  expansion  of  a charge  particle  cloud  with  a temporally  i 

decaying  nonlinear  diffusion  coefficient.  ] 

1 

AMS(MOS)  Subject  Classification:  78.35 

Key  Words:  Self  similar  solution  of  nonneutral  plasma  expansion 
Work  Unit  Number  3 - Applications  of  Mathematics 


* On  leave  from  the  University  of  Iowa. 

Sponsored  by 

the  United  States  Army  under  Contract  No.  DAAG29-75-C-0024  and 
the  National  Science  Foundation  Grant  #ENG  76-15645. 


i 


1 


1 


NONMCUTRAL  PLASMA  EXPANSION 


K.  E,  Lonhgren 

In  A previous  report}^  we  presented  • methodical  procedure  for  obtaining  the  self 
sisdlar  variables  using  some  of  the  techniques  of  Lie  Group  Theory.  General  physical 
phenomena  which  occur  in  plasmas  and  could  be  modeled  with  equations  amenable  to  a self- 
similar treatamnt  were  presented  along  with  their  solutions.  An  extensive  list  of  ref- 
erences was  also  given.  The  purpose  of  this  report  is  to  sumnarize  a pot-pourri  of 
further  examples  which  fall  under  the  umbrage  of  "Nonneutral  Plassia  Expansion". 

With  the  increased  interest  in  relativistic  electron  beam  fusion  devices,  consider- 


able attention  has  been  given  to  the  study  of  nonneutral  plasmas.  The  recent  monograph 
(2) 

by  Davidson  summarizes  the  past  work  and  motivates  our  interest  in  exeunining  various 
aspects  of  this  problem.  In  this  report,  we  shall  examine  three  sets  of  fluid  equation 
models  and  using  the  technique  of  "Seif  Similar  Solution  of  Partial  Differential  Equations, 


obtain  analytical  solutions. 


In  addition,  a recent  experiment  on  the  Wisconsin  Multipole  suggests  that  the  cross- 
field diffusion  coefficient  can  be  modeled  with  a one-dimensional  diffusion  equation 
where  the  diffusion  coefficient  is  nonlinear  and  exponentially  decaying  in  time.^^^  A 


self-similar  solution  of  this  problem  shall  also  be  given. 


In  Section  II,  we  describe  the  various  models  and  present  the  self-similar  solutions. 


The  models  are:  (1)  The  expansion  of  a Maxwellian  electron  cloud  governed  by  Ohm’s  Law; 
(2)  The  expansion  of  an  electron  cloud  using  a mobility  model;  (3)  The  expansion  of  an 
electron  cloud  using  a cold  plasma  model;  and  (4)  The  expansion  of  a charge  particle 
cloud  with  a temporally  decaying  nonlinear  diffusion  coefficient.  Section  III  is  the. 
conclusion. 


II.  Various  Problems 


To  analyze  the  problems,  we  follow  the  procedure  given  in  reference  1 and  only 

describe  the  physical  phenomena,  list  the  PDE,  the  self  similar  variables,  the  ODE  and 

the  solution  to  the  ODE  without  repeating  the  details  of  the  procedure  in  each  case. 

(41 

1)  Expansion  of  a Maxwellian  electron  cloud  governed  by  Ohm's  Law 
In  this  example,  we  examine  the  expansion  into  a vacuum  of  a thermalized  electron 
cloud,  described  by  an  isothermal  Maxwellian  distribution.  It  is  assumed  that  collisions 
are  sufficiently  frequent  such  that  we  can  speetk  of  a conductivity  for  the  medium. 

The  governing  equations  are: 

I)  Equation  of  continuity 

(1) 


II)  Ohm's  Law 


III)  Maxwellian  electrons 


ix  Pt  = 0 


j ■ and 

X 


P - "gq  expEq^AgT^l 


(2) 


(3) 


where  all  symbols  are  standard.  By  differentiating  (3)  with  respect  to  x and  substitut- 
ing (2) , we  obtain 


a ..  B e 1 

j • -o — p 

q p X 


(4) 


In  normalized  units,  (4)  and  (1)  lead  to 


(i  - 0 


(5) 


y y 


where 


♦ - O/n^q,  y - T - t/(E|j/o) 

and  X.  is  the  Debye  length  and  C /a  Is  a relaxation  time. 

D 0 

Ne  find  that  (S>  admits  a self-similar  solution  of  the  form 

n(C)  • T\|((y,T),  C - y/T  . (6) 

The  solution  satisfies  the  conservation  law  that  electron  charge  is  conserved  in  space. 
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The  boundary  conditions  which  are  germane  to  this  problem  are: 

a)  i|((O.T)  - ii^/T  • n(5:-o)  - l|»jj 

b)  l|>(-,T)  - 0 - l|»(y,0)  • n(C-“)  - 0 i.e.  "consolidation",  (7) 


Substituting  (6)  into  (5) , %«e  obtain 


The  first  Integral  of  (B)  is 


(8) 


-en  ^ Cj  - i . (9) 

We  shall  further  is^se  the  condition  that  the  current  j ■"  0 as  y -►  •>>  . From  (4)  , 

this  transforms  to  i 0 as  ^ » . Using  this  and  (7b)  , - 0 . 

The  integral  of  (9)  is 


C /2  + C, 


(10) 


The  constant  is  determined  from  (7a)  to  be  « l/l))^  . In  terms  of  y and  T , the 

final  result  is 


"o^  TlyV2T^  + l/4ijjl 


(11) 


Charge  is  conserved  as  is  shown  below. 


/ dy  « -i—  / — - ^ y ■ const.  (12) 

0 "o'*  "o'*  0 (5^/2  + l/K-^l  "o'*  ^ 


In  conclusion,  we  have  examined  the  expansion  of  an  electron  cloud  in  a vacuum. 
Under  conditions  where  Coulomb  forces  can  be  neglected,  this  calculation  could  model  an 
electron  cloud  e:q>ansion  in  a plasma  in  time  scales  short  with  respect  to  ion  motion. 

2)  Expansion  of  an  electron  cloud  using  a mobility  model 

Recently,  considerable  attention  has  iseen  given  to  the  problem  of  the  transient  be- 
havior of  the  bul)c  electric  field  and  space  charge  distribution  in  semi-conductors  and  in 
the  conduction  in  dielectric  and  insulating  fluids.  It  has  been  found  prudent  to  use  a 
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■obility  Bodel  whare  tha  valocitias  of  charga  carrlars  injactad  froai  an  aalttlng  elactroda 
ara  proportional  to  tha  alectric  fiald  through  thair  Bobilities  and  tha  alactric  fiald  ia 
related  to  the  charge  densities  of  the  carriers  through  Gauss's  law.  Many  and  Rakavy 
and  Helfrich  and  Mark^^^  were  probably  the  first  to  suggest  that  tha  problem  could  be 
■odeled  with  the  set  of  dimensionless  equations 

B - p 
X 

ijj  + - 0 (13) 

i - ce 

which  are  Poisson's  equation,  the  equation  of  continuity  and  a mobility  definition  for 
current  respectively.  The  subscript  x and  t denote  a partial  differentiation  with 
respect  to  space  and  time. 

In  their  original  paper.  Many  and  Rakavy*®'  obtained  a solution  to  (13)  by  looking 
for  the  characteristics  of  the  problem.  Subsequently,  this  approach  was  extended  by  Batra, 
Schechtman  and  Seki,^®^  Zahn,  Tsang  emd  Pao,  de  Oliveira  and  Ferreira^^*^*  2md  others. 

An  extensive  list  of  relevant  experimental  observations  is  given  in  reference  9. 

As  the  problem  is  extremely  iiqportant,  we  suggest  an  alternative  technique  for  solu- 
tion which  will  describe  the  spatial  and  teaqioral  evolution  of:  Da  fixed  electric  field 
and  II)  a constant  source  of  current  which  are  both  governed  by  (13) . The  technique  that 
wa  shall  apply  is  to  find  the  "self-similar  solution”  of  this  set  of  partial  differential 
aquations. 

Equation  (13)  can  be  written  as 


“xx  ♦ «X>  * “xt  ■ ® 


(14) 


Tha  salf  similar  variables  ara 


♦ - 


.o/Y 


and  C * 


.6/Y 


(15) 


idiara  a/y  and  6/Y  A'*  constants  which  will  be  specified  by  invariance  and  conserva- 
tion raquiraswnts.  Invariance  specifies  that  a - 6 • -y 
Substituting  (IS)  in  (14),  wa  write 
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<♦  - ® ‘ Y " y^^5  * ° 

W«  shall  obtain  solutions  for  (16)  subject  tot  I)  A fixed  electric  field  and  ZI)  a con- 
stant current  at  x • 0 requirement. 

I)  Electric  field  is  constant  at  x • 0 . He  choose: 

a - 0 

e - Y • 

Mlth  these  values,  C ~ x/t  and  4 ■■  B • n>e  integral  of  (16)  with  these  constants  is 

44^  - Wj  - )tj  (17) 

where  )(^  is  a constant  of  integration.  The  constant  is  set  equal  to  zero  since 
E(x  • 0,t)  >0  in  order  to  satisfy  space  charge  limited  conditions.  This  specifies 
^(E  ~ 0)  • 0 . The  solution  of  (17)  is  ^ ~ C from  which  we  confute  that 


E 


i 


(t^tg)" 


(18) 


where  the  constants  Xg  and  tg  have  been  expllcitely  included  since  (13)  is  invariant 
to  translation. 

ZI)  Current  is  constant  at  x • 0 . He  choose: 


2a  6 

Y * Y ■ 

Hlth  this  choice,  %«e  have  C ^ ^ where  ^ satisfies  (16)  tihich  becosies 

(«  - 25)*^^  - ♦ (♦^)^  - 0 . (20) 

This  can  be  integrated  once  to 


i 

I 


J 
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«♦  - 2C)*^  t * - ICj  . (21) 

The  constant  of  Integration  is  set  equal  to  zero  since  we  require  that  E(x  » O.t) 

be  zero  for  space  charge  limited  conditions  which  specifies  ~ 0)  ~ 0 . The  integral 

of  (21)  is 


from  which  we  compute  that 


E “ t+t„ 


(tttQ) 

where  again  the  constants  x.  and  t„  have  been  introduced  since  (13)  is  invariant  to 

0 0 

translation.  Note  that  this  is  identical  to  (18) . 

In  (21),  we  can  also  obtain  the  solution  for  a non-space  charge  limited  condition 
(B(x  “ 0,t)  ^ 0)  by  setting  the  constant  of  integration  equal  to,  say,  21^  . The 


integral  of  (21)  can  then  be  written  as 


from  which  we  confute  that 


? > in 


..C..r,)(io>^ 


(t+to) 


2 "^0 


where  again  the  constants  x^^  and  . t^^  have  been  reintroduced.  Note  that  (25)  reduces 
to  the  apace  charge  limited  case  for  i^  - 0 . 

In  conclusion,  we  have  sho%m  that  the  set  of  equations  which  describe  the  Transient 
Space  Charge  Limited  Current  Problem  admit  self-similar  solutions  for  two  physically  in- 


teresting boundary  conditions.  These  solutions  are  valid  in  the  initial  stages  before 
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the  particles  reach  a second  electrode  which  nay  be  placed  at  x • L . 


I 


3.  Expansion  of  an  electron  cloud  using  a cold  plasaia  model 

A aodel  which  can  describe  the  behavior  of  an  electron  cloud  expansion  into  a fixed 
ion  background  is  to  assume  that  Poisson's  equation  is  an  initial  condition.  The  electro- 
static approximation  for  the  V X B Maxwell  equation  and  the  continuity  equation  assure 
that  Poisson's  eqxiation  is  satisfied  for  all  time.^^^^ 

The  basic  equations  are: 

- 0 


+ Bwv 

t X 


-eE 


(26) 


V X B ' 0 - EjjE^  - pv 

which  are  the  equations  of  continuity  and  motion  and  Maxwell's  equation  respectively. 

OD 

The  self  similar  variables  which  Satisfy  the  conservation  law  that  / ndx  « constant 

0 


E;  N = nt 


v/t;  i - x/t  . 


(27) 


These  are  the  same  self  similar  variables  that  were  obtained  in  an  earlier  study  of  the  set 

(26)  where  the  Ansatz  that  Maxwell's  equation  could  replace  Poisson's  equation  had  not 
(12) 

tMen  made.  In  the  earlier  study,  it  was  not  a pedestrian  task  to  integrate  the  ODE. 

Substituting  (27)  into  (26),  we  now  obtain  the  ODE: 

0 

0 - 2CU^  + UU^  - -e  (28) 

+ NU  - 0 . 


-2N  - 2CN^  + (NU)^ 


A solution'  of  this  set  is 

U>2C,  N-2  and  e>  -2^ 

from  which  we  write  the  solution  of  (26)  using  (27)  and  (29)  as 


_ 2x  2 2x 


(29) 


(30) 


In  conclusion,  we  find  that  in  the  final  self  siavilar  solution,  the  density  is 
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Independent  of  position  at  the  end  rather  than  making  it  an  a priori  assua^tion  in  the 
calculation  as  did  Glntsburg  who  treated  a similar  problem. 

4.  Expansion  off  charged  particles  with  a temporally  decaying  nonlinear  diffusion 
coefficient 

In  recent  experiments  on  the  Wisconsin  Multipole,  it  was  confirmed  that  the  cross- 
field diffusion  coefficient  depended  on  time  and  amplitude  as^^^ 

D ~ . (31) 

Incorporating  this  in  the  one  dimensional  diffusion  equation,  we  obtain 


• ['■"  7=  "«] 


where  all  constants  except  a have  been  suitably  normalized  away.  A change  of  variables 

T - i (1  - e"®^]  (33) 

a 


transforms  (32)  to 


The  self  siidlar  treatment  of  (34)  is  straightforward,  at  least  for  the  case  where 

a» 

the  conservation  law  / ndx  • constant  is  valid.  The  self  similar  variables  are 
0 

M - nT^''^  and  e - x/T^^^  (35) 


and  the  resulting  ODE  la 


4<»>e 


[A  "']e  • 


If  tlte  burst  of  particles  is  symswtric  at  x • 0 such  that  n I ~ 0,  and 

*'x-0 

-2/3 

ii(ji^«T)  • T ' (36)  can  ba  integrated  twice  to  yield 


n — 


2 ♦ 5 /3 
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Using  (33)  and  (35)  in  (37) , we  finally  obtain 


n(x,t) 


4[i(l-e"®’')J^ 


(38) 


He  note  that  (38)  gives  a reasonably  accurate  qualitative  description  to  the  ex- 
perimental results.  This  seems  true  even  though  the  time  scale  in  the  experiment  is 
long  such  that  normal  modes  have  been  exited. 

III.  Conclusion 

The  self  similar  behavior  of  four  plasma  phenomena  have  been  described. 
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THE  MATHEMATICAL  DESCRIPTION  OF  A MOVING  BOUNDARY 
PROBLEM  IN  AN  ELLIPTIC  - PARABOLIC  SYSTEM  OF 
PARTIAL  DIFFERENTIAL  EQUATION  IN  THE 
HYDRODYNAMICS  OF  POROUS  MEDIA 

Yoshisuke  Nakano 
U.S.  Army  Cold  Regions 
Research  and  Engineering  Laboratory 
Hanover , New  Hampshire 


ABSTRACT.  The  simultaneous  solution  of  two  tjTpes  of  partial 
cifferenxial  equations,  a parabolic  equation  for  vmseturated  flow  and  an 
elliptic  equation  for  saturated  flow  is  required  for  analysis  of  water 
movement  in  partly  unsaturated  and  partly  saturated  porous  media.  A new 
end  complete  mathematical  description  of  the  boundary  is  obtained.  It 
is  proven  that  the  boundary  is  generally  a singular  surface  and  the 
existing  theory,  which  neglects  such  singularity  is  incorrect. 

I.  li.'THOD'uCTION.  The  analysis  of  water  movement  in  a partly 
unsaturated  and  partly  saturated  porous  medium  requires  the  simultaneous 
solution  of  two  t;.'pes  of  partial  differential  equations;  a parabolic 
equation  for  the  unsaturated  part  and  an  elliptic  equation  for  the 
saturated  part.  Raats  (l9T2,  cf.  Raats  and  Gardner,  197*^)  studied  the 
Douncao'  condition  between  these  two  parts  and  found  that  no  condition 
necessarily  had  to  be  imposed  on  the  boundary.  This  finding  has  been 
accepted  as  the  complete  mathematical  description  of  the  boundary. 

However,  in  the  present  work  it  is  proven  that  the  boundary  condition 
derived  by  Raats  (1972),  although  correct,  is  not  complete,  because  it 
does  not  account  for  the  discontinuity  of  a certain  physical  variable. 

A new  and  complete  mathematical  description  accounting  for  this  dis- 
continuity is  derived.  It  is  also  proven  that  a similar  boundary 

condition  holds  true  for  a wetting  front.  ] 

II.  THEORY.  Consider  a material  volume  V intersected  by  a moving 

bo'undary  o with  a velocity  vector  p.  The  material  volume  V consists  of 

a saturated  part  V and  an  unsaturated  uart  V as  shown  in  Figure  1.  V 

is  bounded  bv  surf aces  S and  c while  V is  oounded  by  surfaces  S 

_ E s u •'  u : 
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The  signs  of  iinit  no>-T5al  vectors  ji  on  these  surfaces  are  indicated 
■fay  superscripts  in  Figure  1.  V.'e  follov  the  standard  niethod  of  deriva- 
tion used  by  Paats  and  Gardner  (197^).  For  the  sake  of  simplicity 
ve  assure  that  the  nedium  neither  exhibits  the  capillary  fringe  phenomenon 
nor  ccni.ains  either  a source  or  a sink..  Vc  also  assume  that  all  variables 
are  siiiooth , in  other  vends,  all  variables  including  their  derivatives 
of  all  orders  are  continuous  in  V-a. 

Atplying  the  FeyroZis'  transport  t;;errem  to  the  tvo  volumes  V 
J^nd  by  the  use  of  a Cartisian  coordinate  system  (x,  y,  z)  attacned 
to  the  solid  phase  of  the  medium,  ve  get 


^ / c-dV  = / 9v  dV  + r eu  dS 

I>t  7 ^ n J n 

s 6* 

s 

^ / edv  = / ly  dv  + / ev  ds  + / 6u  c 
Dt  F,  ot  / n in 


= volumetric  vater  content.  In  V C-9  , vhere  9 is  t'je  sat- 

s s s 

'lU'ated  voltr;etri c vater  content. 

= flux  of  vater,  vhere  v is  the  velocity  vector  of  vater  relative 
to  the  solid  phase. 

= 9(v-p)  = component  of  the  flux  in  the  direction  of  ji. 

= U’  n = comnonent  of  the  velocitv  vector  u in  the  direction  of 

'\t  "K/  *\l 

fi.  The  Tiositive  direction  of  p on  o is  defined  as  outvard  from 
the  ratv.raied  part. 

= material  deri  .'atia'c . 

: Fev.-jolds'  transport  theorem  is  essentially  a math um at i cal 
- te-veen  time  derivai ic  es  in  ue.-ms  of  tv.*c  different  space 
j: e systems  (Fringen,  IpoT). 
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CRITICAL  REVIEW  OF  ONE-DIMENSIONAL  TUBE  FLOW 
EQUATIONS 


Aivars  K.R.  Celmigs 
Ballistic  Modeling  Division 
Ballistic  Research  Laboratory,  USARRADCOM 
Aberdeen  Proving  Ground,  Maryland  21005 

ABSTRACT 

Flows  through  ducts  or  pipes  are  often  analyzed  theoretically  and  numer- 
ically using  one-dimensional  flow  equations.  Generally  it  is  assumed 
that  the  equations  describe  relations  between  average  flow  properties  and 
that  they  are  adequate  if  the  axial  component  of  the  flow  dominates.  This 
paper  reviews  the  derivation  of  the  governing  equations.  It  is  shown  that 
equations  which  are  traditionally  used  for  tube  flows  have  a very  limited 
scope  of  applicability.  Their  theoretical  validity  is  in  essence  restricted 
to  steady  incompressible  flows.  In  cases  of  more  complicated  flows  some 
terms  in  the  traditional  momentum  and  energy  equations  can  be  in  error  by  up 
to  50%.  It  is  also  shown  that  the  popular  approximation  of  the  energy 
dissipation  function  by  the  product  of  the  average  velocity  and  average  shear 
stress  is  appropriate  for  the  simplest  flows  only.  The  paper  reveals  short- 
comings of  traditional  methods  of  derivations  of  tube  flow  equations  and 
provides  explicit  formulas  for  correction  terms  which  should  be  used  in  the 
governing  equations.  An  interesting  property  of  the  new  equations  for 
average  flow  properties  is  that  the  momentum  equation  and  energy  equation 
cannot  be  combined  with  the  continuity  equation  to  yield  simple  equations 
for  velocity  components  and  specific  energy,  respectively.  Consequently  a 
divergence  form  of  the  equations  can  be  obtained  only  if  momentum  components 
and  energy  per  unit  volume  are  used  as  unknowns. 
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1 . INTRODUCTION 


We  consider  in  this  paper  the  derivation  of  governing  equations 
for  fluid  flows  through  ducts.  Such  flows  are  important  elements  in 
many  mechanical  systems.  Most  fluid  mechanics  textbooks  present, 
therefore,  a simple  derivation  of  the  governing  equations,  which 
reduces  the  general  three-dimensional  equations  to  a set  of  equations 
for  one-dimensional  flow.  Experience  has  shown  that  these  equations 
are  adequate  for  many  applications.  Probably  because  of  this  success 
researchers  sometimes  tend  to  disregard  the  limits  of  applicability 
of  the  one-dimensional  flow  equations.  In  order  to  derive  governing 
equations  for  more  complicated  flows  they  duplicate  the  steps  used  for 
simple  duct  flows.  The  resulting  equations  are  not  always  adequate, 
e.g.,  in  case  of  certain  non-steady  flows.  Some  textbooks  discuss 
limitations  of  the  usual  tube  flow  equations.  Often,  however,  the 
discussion  is  rather  general,  or  limited  to  examples  and  exercise 
problems,  and  easily  overlooked  by  casual  readers.  In  this  paper  we 
will  concentrate  on  the  limitations.  We  will  keep  the  discussions 
simple  by  considering  in  detail  only  a one-phase  flow  in  a straight 
duct  with  a constant  cross-section.  The  discussion  of  the  example  will 
provide  a methodical  approach  to  the  derivation  of  flow  equations  for 
more  general  cases. 

The  starting  point  of  our  discussion  is  the  set  of  general  three- 
dimensional  flow  equations.  In  order  to  make  this  paper  self-contained, 
we -list  the  equations  in  Section  2.  In  Section  3 we  specialize  the 
equations  for  the  case  of  a duct  flow  using  a standard  procedure,  which 
is  found  in  textbooks.  In  order  to  establish  limits  for  the  validity 
of  the  specialized  equations,  we  carry  out  in  Section  4' a more  careful 
derivation  of  the  duct  flow  equations.  This  derivation  provides  quan- 
titative information  about  the  errors  which  are  introduced  by  the  special- 
ization of  the  equations.  A comparison  of  the  derivations  and  results 
of  Sections  3 and  4 reveals  that  in  standard  derivations  of  the  equations 
some  non-zero  terms  are  neglected.  In  Section  S two  examples  are  pre- 
sented: a steady  flow  and  an  approximation  to  an  interior  ballistics 
flow.  Quantitative  estimates  are  given  for  some  usually  neglected  terms 
in  the  governing  equations.  Section  6 contains  some  conclusions  which 
can  be  drawn  from  the  discussions  of  the  equations. 


2.  BASIC  GOVERNING  EQUATIONS 


We  consider  flows  which  satisfy  conservation  laws  for  mass,  momen- 
tum and  energy.  Governing  equations  for  such  flows  are  derived  and 


discussed,  e.g.,  by  Tsien  in  Reference  1 and  Batchelor  in  Reference  2. 

In  this  section  we  summarize  the  equations  in  order  to  make  this  paper 
self-contained.  We  use,  in  general,  the  same  notation  as  Tsien,  includ- 
ing the  convention  about  the  summation  over  equal  indexes. 

First  we  will  consider  the  equations  in  integral  form.  In  these 
equations  the  volume  integrals  are  for  an  arbitrary  control  volume  V, 
vdiich  need  not  be  simply  connected.  We  assume,  however,  for  simplicity 
that  its  surface  S has  everywhere  an  outward  pointing  normal  nj . The 
conservation  of  mass  can  then  be  expressed  by  the  equation 


1^  / p dV  + ^pu^n^dS  = 0 


J J 


(2.1) 


The  momentum  equations  are 

/ pUjjdV  + /pUj^u^n^dS  + / dV  = / Fj^dV.  (2.2) 


The  specific  kinetic  energy  of  the  fluid  is 


k 


iujUj. 


(2.3) 


Combining  eqs.  (2.1)  and  (2.2)  we  obtain  for  the  kinetic  energy  the 
equation 


^ Jp  h dV  +^p  k 


.|2-dV 
.3x. 


u.F.dV  . 
1 3 


(2.4) 


Taien,  "The  Equations  of  Gas  Dynamics, " in  Fundamentals  of  Gas 
Dynamics,  edited  by  E.W.  Ermons,  Princeton  University  Press,  1958. 


^G.K.  Batchelor,  An  Introduction  to  Fluid  Dynamics,  Cambridge  University 
Press,  1967. 


The  first  law  of  thermodynamics  is 


y*p  e dV+^p  e UjHjdS+J* p dV  * • (2.5) 

)y  adding  eqs.  (2.4)  and  (2.5)  we  obtain  an  equation  for  the  specific 
:otai  internal  energy  e + k: 

J P(e  + k)dV-f^ip(e  + k)u^njdS+^p  u^n^dS  dV +^*((1)+  UjFj)dV. 

(2.6) 

rhe  last  integral  in  eq.  (2.6)  is  the  contribution  of  external  and 
(dscous  forces  to  the  changes  of  the  total  internal  energy.  Its  first 
part,  J^V,  is  the  contribution  of  viscous  forces  to  the  internal  energy 
e.  The  integrand $ , i.e.,  the  heat  dissipation  function,  can  be  ex- 
pressed in  terms  of  the  viscous  stress  tensor  Tji^: 


3u. 

* "kj  n- 


(2.7) 


We  assume  that  the  viscous  stress  tensor  is  related  by  Stokes  for- 
mula to  the  strain  rate  tensor  ej^j  (Reference  1,  page  13,  Reference  3, 
page  132) 


A 

T.  . = 2u  E.  . ♦ fw'  - 4tll  5.  .E.. 
kj  kj  y ■'  kj  11 


(2.8) 


This  definition  is  not  restricted  to  constant  viscosities  y and  y',  i.e., 
to  homogenous  fluids.  However,  it  restricts  the  considerations  to  iso- 
tropic fluids.  The  viscosities  y and  y’  must  be  positive  or  zero. 


^G.  Eamel,  Meohanik  der  Kontinua,  R.G.  TeidmeVf  Stuttgart,  1956. 
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SubstitutinK  (2.8)  and  (2.9)  into  eq.  (2.7)  we  obtain  the  following 
expressions  for 


^ ‘kj'kj  * 1“>  %'ii'kj  ■ 

• ‘‘^’trace  * ' I")  ‘'>Srac.' 

, 3u,  3u.  2 - 3ui.  2 


(2.10) 


Eq.  (2.10)  shows  that  the  heat  dissipation  function  ^ is  always  positive 
for  a stress  tensor  of  the  form  (2.8). 

The  term  /ujF.dV  in  eq.  (2.6)  is  the  contribution  of  viscous  and 
body  forces  to  tne"^  changes  of  the  kinetic  energy  k.  The  force  (per  unit 
volume)  Fj  is  a sum  of  body  forces  pXj  and  viscous  forces  Tj . TTie  latter 
can  be  expressed  in  terms  of  the  viscous  stress  tensor  Tj^- . We  thus 
have  the  equation  ^ 


$ u.F.  = <6+  u.T.  + pu.X.  ■ 
^ 11  ^ 11  11 


T (u.T..)  ♦ pu.X. 

3x.  '•  1 Jk"  11 


(2.12) 


In  this  form  we  have  subdivided  the  contributions  of  forces  to  the 
changes  of  the  total  internal  energy  into  contributions  by  viscous  and 
by  body  forces.  The  corresponding  volume  integral  in  eq.  (2.6)  is 

J*($*  UjF^)dV  = J{$  UjTj  ♦ PUjX.)  dV  * 

* ^ f (2.13) 


We  note  that  the  surface  integral  in  eq.  (2.13)  contributes  to  the  inter- 
nal as  well  as  to  the  kinetic  energy  of  the  flow.  It  represents  the 
viscous  forces  acting  on  the  surface  of  the  control  volume.  The  volume 
integral  over  the  body  forces  contributes  to  the  kinetic  energy  only. 


The  governing  equations  (2.1),  (2.2)  and  (2.4)  through  (2.6) 
also  be  expressed  in  differential  form  as  follows 

H - ~ (P  U.)  = 0. 


It  ^ “k“j^  * ^ ' 

It  (P  k)  * ^ (p  k u.)  > u.|£-  = u.F.  . 


3 


3u. 


It  (p  ^ (p  ® Uj)  ♦ p 33^'-“  ^ 


3x. 

J 


It  ^ air  [p(®*’')“3]  * af-  (P  • Q-aT^  * '^j^j  • 


can 

(2.14) 

(2.15) 

(2.16) 

(2.17) 

(2.18) 


Eq.  (2.16)  is  a consequence  of  eqs.  (2.14)  and  (2.15),  because  k is 
defined  by  eq.  (2.3).  Also,  eq.  (2.18)  is  the  sum  of  eqs.  (2.16)  and 
(2.17).  We  have,  therefore,  only  five  independent  differential  equa- 
tions for  the  six  quantities  p,  U]^,  p,  and  e.  To  complete  the  system 
of  equations  we  need  another  equation,  which  is  provided  by  the  equa- 
tions of  state  for  the  fluid  under  consideration.  We  assume  that  such 
an  equation  is  available,  e.g.,  in  the  form 


0(e,p,p)  » 0 , 


(2.19) 


1 


and  that  eq.  C2.19)  can  be  solved  explicitly  for  either  of  the  argu 
ments.  For  example,  in  case  of  an  ideal  gas  with  constant  specific 
heats  eq.  (2.18)  is 


£..  (Y-l)e  = 0 . (2.20) 


For  the  discussions  in  the  rest  of  this  paper  we  will  not  make  use  of 
eq.  (2.19)  or  (2.20).  The  assumption  of  the  existence  of  such  an  equa- 
tion is  made  here  only  to  close  the  set  of  governing  equations. 


3.  APPROXIMATE  GOVERNING  EQUATIONS  FOR  DUCT  FLOWS 

In  this  section  we  derive  approximate  governing  equations  for  duct 
flows.  The  dominant  component  of  such  flows  is  usually  in  the  axial 
direction.  Also,  in  many  cases  only  the  dependence  of  flow  properties 
on  the  axial  coordinate  is  of  practical  interest.  Duct  flows  are  there- 
fore usually  treated  by  one-dimensional  equations  which  are  derived 
from  the  general  flow  equations  of  Section  2. 

A standard  procedure  for  the  derivation  of  these  equations  is  to 
consider  a control  volume  which  consists  of  a length  Az  of  the  duct. 

Hie  integral  forms  of  the  governing  equations  are  applied  to  this 
control  volume  and  corresponding  differential  equations  obtained  by 
letting  Az  approach  zero.  This  method  is  used,  e.g.,  in  References  4 
and  5,  and  we  will  follow  these  references  closely. 

Another  possible  approach  is  to  start  with  governing  equations  for 
one-dimensional  flow,  i.e.,  a flow  which  depends  on  only  one  coordinate 
and  which  has  a velocity  component  in  the  direction  of  that  coordinate 
only.  Three  dimensional  effects,  e.g.,  from  the  wall  friction,  are  then 
added  to  the  equations  by  ad  hoc  procedures.  We  will  not  pursue  this 
approach  here  because  the  former  approach  can  be  generalized  more 
easily. 


Shapiro,  The  Dimamias  and  Thermodynamioe  of  Compressible  Fluid 
Wlcni,  Vol.  I and  II,  Roland  Preaa  Compcmy,  New  York,  1964. 

5l.  Croooo,  "One-Dimeneional  Treatment  of  Steady  Gaa  Dynamioa"  in 

Pundamentala  of  Gaa  Dynamioa  II,  edited  by  H.W.  Ermona,  Princeton  '» 

Univeraity  Preaa,  1958, 
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Let  z be  the  axial  coordinate  and  let  for  simplicity  the  cross- 
sectional  area  A of  the  duct  be  constant.  The  continuity  equation  (2.1) 
is  then  for  the  control  volume 


(3.1) 


The  bars  on  P and  u in  eq.  (3.1)  indicate  that  we  are  dealing  with 
average  density  and  velocity,  respectively.  We  apply  now  the  mean  value 
theorem  to  the  first  term  in  eq.  (3.1)  and  use  a Taylor  series  expansion 
for  the  second  term.  The  result  is 


{p(z)  A)  • Az  ♦ At  • A =•  0(Az2)  , (3.2) 


where  z s.  z z * Az.  Letting  Az  in  eq.  (3.2)  approach  zero  we  obtain  the 
continuity  equation 


3p  3(3  u) 
at  az 


0 . 


(3.3) 


The  momentum  balance  equation  is  considered  in  the  z-direction  only. 
First  we  obtain  as  above  from  eq.  (2.2) 


{?(£)  a(z)  A}*Az  + Az*A  ^ ^2*A  ■ F*A-Az  ♦ O(Az^)  . (3.4) 


The  momentum  equation  for  the  average  flow  properties  is  obtained  from 
eq.  (3.4)  by  letting  Az  approach  zero.  The  result  is 


_3 

at 


(3  0)  > -jI-  (3  0^)  - 


az 


F . 


(3.S) 
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Hie  force  per  unit  volume,  7,  can  be  expressed  as  a sum  of  two  compo- 
nents in  analogy  to  eq.  (2.11).  The  momentum  equation  is  then 


^ (P  Q)  ♦ 05  ♦ H ■ ? X ♦ T . (3-6) 


The  quantity  T in  eq.  (3.6)  is  obtained  from  the  resultant  of  the  viscous 
boundary  forces  on  the  surface  of  the  control  volume.  For  simple  tubes 
T can  be  expressed  in  terms  of  the  pipe  friction  coefficient  or,  by  exper- 
imental correlations^  ^n  terms  of  the  surface  roughness  and  perimeter  of 
the  tube.  The  term  p X usually  represents  the  gravity  force  component  in 
the  axial  direction  of  the  tube. 

A combination  of  eqs.  (3.3)  and  (3.6)  yields 


(i  P ♦ al  (i  P “ H • ^ “ S p X ♦ u T . (3.7) 


Eq.  (3.7)  can  be  considered  as  an  equation  for  the  kinetic  energy,  if 
the  latter  is  approximated  by 


I - 


(3.8)- 


However,  eq.  (3.7)  is  a mathematical  consequence  of  the  continuity  and 
momentum  equations,  i.e.,  eqs.  (3.3)  and  (3.6),  and  is  independent  of 
any  assumptions  about  the  kinetic  energy. 

Next  we  consider  the  energy  balance.  Following  general  practice 
(see,  e.g..  Reference  4)  we  start  with  the  eq.  (2.6)  instead  of  using 
the  first  law  of  thermodynamics,  i.e.,  eq.  (2.5).  For  the  control  vol- 
ume we  obtain  first 

d 3 

{p*(e-*'lc)*A}*Az  ♦ dz*A  {p*u.(e+T)  ♦ u*p}  ■ 

■ (Q  - |^)’A*dz  ■••  ( $♦  u*F)*A*Az  O(Az^)  (3.9) 
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At  the  liait  &z  0 eq.  (3.9)  yields  the  energy  equation 

(5  (S^lc)}  ♦ I7  « n (*♦£)  ♦ n p}  - 

(3-10) 

The  equation  of  state,  such  as  eq.  (2.19),  containsjjsually  the 
internal  energy  e and  not  the  total  ijitemal  energy  e IT.  Therefore, 
eq.  (3.10)  is  modified  to  eliminate  k.  To  this  end  it  is  assumed  that 
the  approximation  (3.8)  holds,  and  eq.  (3.7)  is  subtracted  from  eq. 
(3.10).  The  result  is 


li  (5  » • I7  (5  0 « • F H ■ Q - H ♦ $ • 


Eq.  (3.11)  is,  of  course,  the  first  law  of  thermodynamics  and  could  have 
been  obtained  directly  from  eq.  (2.5)  without  any  assumptions  about  the 
kinetic  energy . 

In  order  to  use  eqs.  (3.3),  (3.7),  and  (3.nj_  for  computations  we 
need  among  other  data_estimates  for  the  forces  p X and  T and  for  th£  heat 
dissipation  function^.  The  latter  is  often  expressed  in  terms  of  T by 
the  following  arguments.  (See,  e.g..  Reference  4,  page  39  ff.  and 
972  ff.) 

The  last  integral  on  the  right  hand  side  of  the  energy  equation 
(2.6)  is  according  to  eq.  (2.13) 


W ■ / ($♦  u.F.)dV  ■ ^T..u.n.dS  ♦ / p u X.dV  . (3.12) 

J J J ^ J J 


The  surface  integral  in  eq.  (3.12)  represents  the  work  rate  of  viscous 
forces  acting  on  the  surface  S of  the  control  voliune.  We  subdivide  this 
surface  into  material  boundaries  (e.g.,  duct  walls)  and  flow-through 
surfaces  Sq.  The  work  done  on  material  boundaries  is  called  shaft  work. 
The  work  by  viscous  forces  on  the  flow-through  boundaries  is  called 
shear  work.  Let  the  corresponding  work  rates  be  and  Wq,  respec- 

tively. In  these  terms  eq.  (3.12)  is 


The  integral  Wq  over  the  open  boundaries  is  usually  assumed  to  be  negli- 
gible. For  example,  in  case  of  a tube  flow  it  is  argued  that  integration 
over  the  core  flow  region  contributes  very  little  to  the  integral  because 
is  small  in  that  region.  Integration  over  the  boundary  layer  region 
also  contributes  little  because  the  velocity  Uj  is  small  in  the  boundary 
layer.  Hence  Wq  must  be  small. 

If  we  carry  out  the  derivation  of  eq.  (3.10)  using  the  relation 
(3.13)  we  obtain 

|^[5  (8>k)]  ♦ |7[5  0 (e>k)  - n Pl  . Q - II  ♦ ♦ W^  ♦ p n X . (3.14) 


Combining  eqs.  (3.14),  (3.7),  and  (3.8)  we  obtain  as  the  first  law  of 


theyrmodynamics  instead  of  eq.  (3.11)  the  equation 

Ij.  (5  i)  . (5  D 5)  . T f ■ Q - .IJo  - = T . (3.15) 

In  this  equation  the  heat  dissipation  function  $ is  approximated  by 


W , ♦ 

shaft 


-nT. 


(3.16) 


At  the  material  boundaries  the  velocity  of  the  fluid  is  equal  to  the 
velocity  of  the  boundary.  Therefore,  Wghaft  is  non-zero  only  if  the 
boundaries  are  moving.  If  the  tube  does  not  contain  moving  boundaries 
and  Vq  is  neglected,  then  eq.  (3.16)  becomes 


$ « -0  T,  (3. 17) 


which  is  the  usual  approximation  of  $ for  tube  flows  (Reference  4, 
page  972  ff.) 

In  case  of  two-phase  flows,  e.g.  particles  submerged  into  the  fluid, 
*^shaft  i®  assumed  £o  be  the  work  of  drag  forces.  I^et  the  average  par- 
ticle velocity  be  Upaj^icle  ^ke  drag  force  be  Then 


The  resultant  T of  the  viscous  forces  is  in  this  case  the  sum  of  par- 
ticle drag  and  wall  friction  forces 


T-  T 


drag 


wall 


(3.19) 


The  equation  for 


$ becomes  then 


$ * ^particle  " ^drag  ” “ ^wall  * 


(3.20) 


This  equation  is  sometimes  modified  by  an  ad  hoc  factor,  see  Reference  6, 
page  81. 

In  summary,  either  eq.  (3.17),  or  eq.  (3.20)  provides  a convenient 
estimate  for  $ i.e.,  for  the  rig^t-hand  side  of  the  energy  equation 
(3.15).  Estimates  of  and  are  also  needed  to  express  the 

terms  on  the  right-hand  sides  of  tne^momentiun  equation  (3.6).  It  appears 
from  the  derivation  that  no  further  estimates  of  flow  properties  are 
needed  under  quite  general  conditions. 

Some  limitations  of  the  approximation  (3.17)  become  obvious  if  we 
consider  non-steady  fluctuating  flows.  In  such  flows  it  is  possible 
that  the  signs  of  T and  u are  temporarily  equal.  In  these  cases  Wg 
cannot  be  neglected,  because  otherwise  we  would  have  a negative  heat 
dissipation  function.  Thus  it  seems  appropriate  to  ask  how  accurate  is 
the  energy  equation  (3.15).  Our  derivation  does  not  provide  any  clues 
to  an  answer  to  this  question.  We  will  therefore  rederive  the  duct  flow 
equations  more  carefully  in  the  next  section,  keeping  track  of  all  approx- 
imations involved. 


4.  PRECISE  GOVERNING  EQUATIONS  FOR  DUCT  FLOWS 

In  this  section  we  will  derive  complete  one-dimensional  governing 
equations  for  flows  through  constant  area  ducts,  including  formulas  for  \ 

quantities  which  were  neglected  in  Section  3.  We  will  then  discuss  the  i 

differences  between  the  complete  equations  and  those  of  the  previous  ! 


^G.B.  Vallia,  One-Dimenaional  Tuo-Phaae  Flow,  MoGtvw-EiVL  Co,  New  York, 
1969. 
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section,  indicating  where  the  previous  derivation  of  the  equations  is 
insufficient. 

One-dimensional  duct  flow  equations  are  relations  between  average 
flow  properties.  The  equations  depend  therefore,  among  other  things,  on 
the  definitions  of  the  averages.  For  steady  duct  flows  certain  averages 
and  corresponding  governing  equations  have  been  discussed  by  Crocco  in 
Reference  5.  Because  the  averages  defined  by  Crocco  cannot  be  used  for 
non-steady  flows,  our  analysis  will  be  different.  The  results  of  our 
analysis  can  be  applied  to  steady  as  well  as  non-steady  flows. 

First  we  consider  the  continuity  equation  (2.1).  For  a control 
volume  which  consists  of  a length  of  of  the  duct,  eq.  (2.1)  is 


x,»Ax-  _ 

3 j j 

It  I • i'  V' 

, A La  Jx, 


(4.1) 


*3  3 

The  integrals  jpds  and  /pu3ds  are  functions  of  X3.  We  expand  these 
functions  in  Taylor  series,  intechange  the  order  of  integration  over  X3 
and  differentiation  with  respect  to  t in  the  first  term  of  eq.  (4.1), 
and  apply  the  mean  value  theorem  to  that  term.  The  result  is 


= 0(^X3^)  . (4.2) 


with  0 < '0  < 1 . 


At  the  limit‘'Ax3  -►  0 eq.  (4.2)  yields 


If  ( p ^ ° • 

A 3 A 


(4.3) 


We  now  define  for  each  cross-section  X3  * const,  an  average  fluid 
density  p by 

P » j / p ds 
A 
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and  an  average  fluid  velocity  u by 


Ujds 


(4.5) 


The  continuity  eq.  (4.3)  can  then  be  expressed  in  terms  of  the  average 
density  and  velocity  as 


iS. 

3t 


+ 


(4.6) 


Eq.  (4.6)  is  identical  to  the  continuity  equation  (3.3).  However, 
we  have  now  established  that  th£  continuity  equation  is  of  this  form 
only  if  the  average  quantities  p and  u are  defined  by  eqs.  (4.4)  and 
(4.5),  respectively.  Thus,  if  we  chose  an  alternate  definition  of  the 
average  velocity,  e.g.,  the  simple  spatial  average 

“S’  = ^ / u^ds  , (4.7) 

, A 

then  the  corresponding  continuity  equation  would  be 


f 'lii;  =)' • <“•« 

The  right-hand  side  of  eq.  (4.8)  is  non- zero  in  general. 

The  momentum  equation  (2.2)  yields  for  a duct  flow  in  analogy  to 
eq.  (4.3) 


If  the  flow  is  axially  symmetric,  then  for  k ■ 1 and  k ■ 2 all  terms 
in  eq.  (4.9)  are  identically  zero.  In  cases  of  non-axisymmetric  flows 
all  three  momentum  equations  are  needed  to  describe  the  flow,  e.g.,  in 
the  case  of  a non-vertical  tube  in  a gravity  field.  We  will  consider 


for  simplicity  only  the  momentum  equation  in  the  axial  x^-direction, 
thus  restricting  the  analysis  to  axisymmetric  flows.  Eq.  (4.9)  thus 
becomes 


It  i » “3'^  * air  i * i Hr**  ■ ( '’3'^  ■ 

A 3 A A 3 A 


(4.10) 


Eq.  (4.10)  contains  two  new  flow  variables  for  which  averages  have  to 
be  defined.  We  chose  the  following  definitions: 


p - i /p  ds 
^ A 


(4.11) 


1^3  - ^ / Fjds 
A 


(4.12) 


Expressing  the  momentum  equation  (4.10)  in  terms  of  average  quantities 
we  then  obtain 


1^  U)  * ^(P  u^)  * |E-  . -Fj  * 


(4.13) 


b ^ H 


p u^dsj 


A Pi' 


(4.14) 


/ P ds  / p 
A A 


Coniparing  the  momentum  equation  (4.13)  with  the  corresponding 
eq.  (3.5),  we  see  that  the  latter  equation  is  in  error.  The  reason  for 
this  error  is  that  eq.  (3.4)  should  have  contained  the  term  dz*C|n*A.  Eq. 
(4.14)  shows  that  this  term  is  non-zero  in  general.  The  expression  in 
square  brackets  in  eq.  (4.14)  is  negative  or  zero  according  to  Schwarz's 
inequality.  It  is  zero  if  and  only  if  U3  - const,  across  the  duct. 

Hence  the  correction  term  Cg,  is  zero  only  in  case  of  a slug  flow  or  if 
the  term  is  independent  of  X3.  The  latter  is  the  case  for  steady  incom- 
pressible flows  through  constant  area  ducts.  In  all  more  interestin'^ 
cases  Cg  is  non- zero  and  its  magnitude  should  be  estimated  to  justity 
the  neglect  of  Cg,  or  Cg  should  be  included  in  the  momentum  equation. 


The  average  force  per  unit  volume,  F3,  is  defined  by  eq.  (4.12). 
For  later  reference  we  note  that  according  to  equation  (2.11)  F3  is  a 
sum  of  body  forces  and  viscous  forces.  We  define  the  corresponding 
averages  by 


and 


(4.15) 


(4.16) 


With  these  definitions  we  have 


Pj . 5X3 . T3 . ("■”) 


We  now  consider  the  first  law  of  thermodynamics,  eq.  (2.5).  First 
we  obtain  for  the  duct  flow  in  analogy  to  eq.  (4.9) 

^ p e ds  + 3!“^  P ® P 3^  ds  = ^ (Q  - -5^)  ds  + ^ <I)ds  . (4.18) 

A A “a  a a 

In  order  to  express  this  equation  in  terms  of  averages  we  define 


and 


e = ^ / P e ds  , 
^ ^ A 

A A 


(4.19) 


(4.20) 


(4.21) 


With  these  definitions  eq.  (4.18)  becomes 


it 


where 


,1-  (p  e u) 


(4.22) 


9u  1_  ( 
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(4.23) 
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(4.24) 


The  nature  of  the  correction  terms  Cg^  similar  to  that  of  the 

correction  term  in  the  momentum  equation.  They  are  zero  for  slug  flow 
and  should  be  estimated  in  other  cases.  If  we  compare  eq.  (4.22)  with 
the  corresponding  eq.  (3.11),  we  see  that  the  latter  is  in  error.  The 
reason  for  the  error  is  an  oversight  of  a term  Az(Cgj  + Cg2) • A which 
should  have  been  introduced  in  eq.  (3.9).  The  correction  terms  enter 
the  equations  because  a product  of  function  averages  is  in  general  not 
equal  to  the  average  of  the  product  of  the  functions.  Or,  differently 
expressed,  multiplications  of  functions  and  averaging  of  functions  are 
not  commutative  operations. 

We  mentioned  in  Section  3 that  $ is  usually  approximated  by  -u  T3. 

It  was  also  shown  that  such  an  approximation  is  based  on  the  assumption 
that  a term  Wq  can  be  neglected.  We  will  now  investigate  the  approxima- 
tion more  carefully.  By  the  definition  (4.20)  we  have 


■ A 3 ^kj  3x  ■ 

A ^ 
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(4.25) 


The  first  term  on  the  right-hand  side  of  eq.  (4.25)  we  recognize  as 
Wq,  i.e.,  the  average  of  the  gradient  of  the  work  rate  of  viscous 
forces_on  the  cross  section  A.  The  second  term  may  be  approximated 
by  -u  T3.  The  final  formula  for  ^ is  then 


where 


and 


$ . - n Tj  . » c,  , 
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(4.26) 


(4.27) 


(4.28) 


Combining  eqs.  (4.26)  and  (4.22),  we  obtain  for  the  energy  equation 
(first  law  of  thermodynamics)  the  expression 


(5  5)  . jf:  (5  e D)  . p II: . n - u T3. 
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The  first  two  correction  "terms,  Cgj  and  C 2*  appear  in  eq.  (4.29) 
because  of  the  averaging  of  some  terms.  Th£  las^^o  correction  terms, 

Wq  and  C$,  are  due  to  the  approximation  of  ^ by -u  T3. 

The  equation  for  the  kinetic  energy  can  be  treated  formally  in 
the  same  manner  as  the  equation  for  the  internal  energy.  One  can  intro- 
duce error  terms,  corresponding  to  and  Cg2,  either  in  the  kinetic 
energy  equation  or  in  the  equation  for  the  total  internal  energy. 

Since  typically  only  the  equation  for  internal  energy  is  needed  for 
computations,  the  other  equations  are  not  formally  derived. 

In  summary,  we  have  shown  that  the  one-dimensional  governing 
equations  for  average  flow  properties  in  duct  flows  are  not  the  same  as 
equations  for  locally  one-dimensional  flows.  If  the  medium  is  compressible 
then  the  additional  terms  in  the  governing  equations  vanish  only  for  slug 
flow.  For  other  flows  the  magnitudes  of  the  terms  should  be  estimated  for 
each  case  to  check  their  significance.  Formulas  given  in  this  section  may 
be  used  for  that  purpose. 


If  the  duct  is  axially  symmetric,  it  is  more  convenient  to  use 
cylindrical  coordinates  than  the  cartesian  coordinates  of  this  section. 
We  give  therefore  in  Appendix  A all  pertinent  formulas  in  cylindrical 
coordinates . 


5.  EXAMPLES  OF  TUBE  FLOWS 


S.l  Incompressible  Steady  Flow  Through  Cylindrical  Tubes 


In  the  case  of  an  incompressible  steady  duct  flow  the  flow  velocity 
is  constant  along  the  duct  and  dependent  on  the  radial  coordinate  r only. 
Also,  only  the  axial  coordinate  u of  the  velocity  is  non- zero.  There- 
fore, of  all  the  correction  terms  given  in  Appendix  A,  only  C*  can  be 
non-zero  in  this  case.  It  is  given  by  eq.  (A. 35),  which  reduces  to 


(S.1.1) 


The  shearing  stress  Tp^Cr)  is  in  the  present  case  a linear  function 
of  r.  This  is  a consequence  of  the  second  momentum  equation  (A. 13) 
which  reduces  to 


|E  = 1 1_  (r  T ) . 
3z  r 3r  rz"^ 


(5.1.2) 


The  left-hand  side  of  eq.  (5.1.2)  is  constant.  Therefore,  tj.,  must  be 
linear  in  r: 


(5.1.3) 


Substituting  eq.  (5.1.3)  into  eq.  (5.1.1),  we  obtain 


C ® u T - - 
* 2 


u r dr 


0 - 2 n t^^CR)  i . 


C5.1.4) 


J 


TTije  average  shear  stress  is,  according  to  eqs.  (A.  17)  and  (A. 21), 

R 

/ 

R"  o 


/ If  "rz^  ^ “ 2t^^(R)  i . (5.1.S) 


Substituting  eq.  (5.1.S)  into  eq.  (5,1.4)  we  see  that  the  correction 
term  is  zero. 

Hence  the  average  flow  equations  are  exact  for  incompressible 
steady  flows  through  circular  tubes.  This  is  essentially  a consequence 
of  eq.  (5.1.3)  and  the  result  is  valid  for  either  turbulent  or  laminar 
flows.  Also,  we  have  not  made  use  of  Stokes  equations  for  the  stress 
tensor,  nor  made  any  assumptions  about  the  viscosity  of  the  fluid. 

5.2  Lagrange's  Interior  Ballistics  Flow 

As  an  example  for  non-steady  tube  flows  we  consider  Lagrange's 
approximation  to  interior  ballistics  flow  (Reference  7).  The  approxi- 
mation is  obtained  by  postulating  that  the  average  axial  velocity  u 
of  the  gas  in  a gun  tube  is  at  any  time  a linear 
distance  z , i . e . , 


P 

where  Zp(t)  and  = dZp/dt  are  the  location  and 
jectile,  respectively.  ^'We  assume  that  the  local 
as  well  as  radial  components  which  may  depend  on 
coordinate  r. 

Some  consequences  of  the  assumption  (5.2.1) 

/^pendix  B.  In  summary,  the  discussion  shows  that  this  assumption, 
complemented  with  a second  Lagrange's  assumption 


function  of  the  axial 


(5.2.1) 


velocity  of  the  pro- 
velocity can  have  axial 
z,  t,  and  on  the  radial 


are  discussed  in  the 


z (0) 

‘’o  z^(t)  ’ 


(5.2.2) 


Comer t Theory  of  the  Interior  Ballietias  of  Guns,  John  Wiley  and 
Sons,  Sew  fork,  1960. 
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is  consistent  with  the  average  continuity  equation  (4.6)  for  the  flow. 
One  can  also  assume  that  the  local  velocity  vector  has  the  form 


<5.2. 3) 


For  any  reasonable^  functions  u(z,t)  and  f(r)  one  can  determine  corres- 
ponding funtions  v(z,t)  and  h(r)  such  that  the  local  continuity  equa- 
tion is  satisfied.  (The  necessary  formulas  are  given  in  Appendix  B.) 
However,  a flow  characterized  by  eqs.  (5.2.1)  through  (5.2.3)  in  general 
does  not  satisfy  the  local  momentum  equations  if  constant  viscosities 
are  assumed.  Hence  Lagrange's  approximation,  (5.2.1)  and  (5.2.2),  and 
a local  velocity  field  of  the  type  (5.2.3)  can  be  consistent  only  for 
inhomogeneous  media,  i.e.,  media  with  variable  viscosity. 

Because  an  exact  solution  of  the  viscous  tube  flow  equations  is 
not  available,  we  cannot  obtain  exact  values  for  all  correction  terms. 
However,  the  correction  term  in  the  momentum  equation  is  independent 
of  the  viscosities  and  can  be  computed  exactly  for  any  flow  profile. 

In  contrast,  the  correction  terms  in  the  energy  equation  can  be  computed 
only  if  additional  information  is  available  about  the  stress  tensor  and 
the  internal  energy  profile.  These  terms  we  will  estimate  by  computing 
their  values  for  constant  viscosities  and  for  a number  of  "reasonable" 
flow  profiles.  We  expect  by  such  calculations  to  obtain  at  least  order- 
of-nEagnitude  estimates  of  the  correction  terms. 

Particularly  we  will  consider  flow  profiles  of  two  types.  First 
we  will  assume  a flow  field  which  is  described  by 


and 


u(r,z,t) 


Zp(t) 


^[‘-(1)1 


> (5.2.4) 
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This  flow  field  has  a Hagen-Poiseuille  profile  for  n = 2.  For  larger 
values  of  n it  approximates  turbulent  flow  profiles  or  profiles  with 
thin  boundary  layers. 

As  a second  example  we  will  consider  a flow  profile  which  approx- 
imates the  universal  profile  for  steady  turbulent  tube  flow. 
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The  flow  field  defined  by  eq.  (5.2.4)  satisfies  the  local  continu- 
ity equation,  if  the  density  is  given  by  eq.  (5.2.2).  We  note  the  inter 
esting  fact  that  local  continuity  requires  the  radial  flow  component  to 
be  directed  toward  the  center  of  the  tube.  This  is  due  to  the  higher 
mass  flow  rate  at  the  center  and  due  to  the  assumed  increase  of  the 
average  axial  velocity  u(z,t)  with  z. 

The  correction  term  of  the  momentum  equation  is  given  for  our 
flow  by  eq.  (B.48) 


Substituting 


f^r 


dr 


3 /■  2. 


into  eq.  (5.2.5),  we  obtain 


C 

m 


(5.2.5) 


(5.2.6) 


(5.2.7) 


The  momentum  equation  is 


therefore  in  terms  of  the  average  axial  velocity 


3(P  a)  ^ 

3t 


(5.2.8) 


Eq.  (5.2.8)  shows  that  in  the  case  of  a Hagen-Poiseuille  profile  the 
momentum  transport  term  in  the  momentum  equation  should  be  increased  by 
about  33%.  Even  for  a rather  flat  profile  with,  say,  n = 10  the  correc- 
tion term  is  9%  in  this  example. 

The  first  correction  term  Cgj  of  the  energy  equation  is  zero  in 
our  example  because  the  density  p is  independent  of  r and  z.  (See 
Appendix  B for  a discussion  of  this  term.) 


The  second  correction  term  Cq2  of  the  energy  equation  is  (see  eq. 
(B.S4)) 
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(5.2.9) 


In  order  to  compute  this  term  we  would  need  to  make  an  assumption  about 
the  internal  energy  function  e.  For  the  present  discussion  we  will  not 
maVe  any  assvunptions  and  leave  eq.  (5.2.9)  unchanged. 

The  average  heat  dissipation  function  which  appears  on  the  right 
hand-side  of  the  energy  equation  can  be  computed  by  the  formulas  (B.56) 
and  (B.57).  The  result  of  the  computation  is 


$ = 2ua2 


1 (n>2)‘ 
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The  equation  for  the  average  internal  energy  (first  law  of  thermo- 
dynamics) is  in  our  case 


Hp.  e)  + i. 


3t  • ■ 3z  Cp  ® ^ P ll  = ^ ^ * ^e2 
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Substituting  eqs.  (5.2.9)  and  (5.2.10)  into  eq.  (5.2.11),  we  obtain 
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(5.2.12) 


In  eq.  (5.2.12)  we  have  included  the  correction  term  Ce2 
energy  flux  term  on  the  left  hand  side.  It  is  readily  apparent  from  the 
form  of  the  term  that  the  correction  is  zero,  if  the  specific  internal 
energy  e is  independent  of  r. 
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In  the  heat  dissipation  function  on  the  right-hand  side  of  eq. 

(5.2.12)  the  term  with  the  factor  (R/z)^  can  generally  be  neglected, 
because  (R/z)^  is  of  the  order  10-^.  (R/z  is  large  in  the  vicinity  of  the 
breech,  where  the  one-dimensional  approximation  should  not  be  used  any- 
w^^  The  other  term  in  the  square  brackets  is  usually  replaced  by 
-u  Tj.  If  this  is  done,  then  two  additional  correction  terms  should  be 
included  in  the  equation.  The  general  formulas  for  these  terms  are 
given  by  eqs.  (B.61)  and  (B.62).  They  are  in  our  case 
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(5.2.14) 


The  first  term  u Tj  in  eq.  (5.2.14)  is  according  to  eq.  (B.63) 
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2u  (n+2).  (5.2.15) 


Comparing  eqs.  (5.2.13)  and  (5.2.15) ,_w£  see  that  the  term  Wq  is 
indeed  small  relative  to  the  magnitude  of  u Tj.  In  Section  3 such  a 
ratio  of  magnitudes  was  anticipated  based  on  plausibility  arguments. 

The  total  correction  is  the  sum  of  and  Wq.  Combining  eqs. 
(5.2.13)  through  (5.2.15)  we  obtain  for  the  sum 
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(5.2.16) 


The  right 'hand  side  of  the  energy  equation  (5.2.12)  is  thus 


TT  ♦ $-  H - a T ♦ C.  ♦ 

Z V 0 


H - tr  Tj  (1-a)  , 


(5.2.17) 


where  a is  a relative  correction  which  is  to  be  applied  to  u T2. 
given  by 


It  is 


Szi  . (rV  2 n*4  f u'  3 siA  (S.2.18) 

2n  [zj  3 (n*l)(n+2)  u 4 n*4j  . 


The  second  term  in  this  fo^ula  can  in  general  be  neglected,  because 
(R/z)  is  of  the  order  lO"  . The  first  term  is  zero  only  for  n * 2, 
i.e.,  for  a Hagen- Poiseui lie  flow  profile.  In  this  case  the  shear 
stress  is  a linear  function  of  r,  which  causes  certain  correction  terms 
to  vanish,  as  shown  in  Section  5.1.  For  a flat  flow  profile  with,  say, 
n > 10,  the  relative  correction  is  a = 0.4.  Clearly  such  a 40%  approx- 
iaiation  error  will  be  seldom  tolerable.  Hence  for  fla^  flow  profiles 
and  constant  viscosities  the  approximation  of  $by  -u  Tj  is  not  realis- 
tic for  calculations  in  interior  ballistics. 

The  flow  profile  which  is  defined  by  eq.  (5.2.4)  does  not  have 
the  characteristic  form  of  a fully  developed  turbulent  flow  profile  for 
any  n.  We  may  therefore  ask  whether  the  correction  terms  are  possibly 
smaller  for  such  a profile.  In  order  to  investigate  this  question  we 
approximate  the  universal  turbulent  profile  (see,  e.g..  Reference  8, 
page  512)  by  defining 


■ - 1)’  * • 2 [‘  - 1)'*]  ] • 


(5.2.19) 


The  corresponding  function  h(r)  is 


- 0.456 


(5.2.20) 


Sahliahtingt  Bou 
(4th  Edition),  1960. 
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The  correction  term  Cg,  of  the  momentum  equation  can  now  be 
computed  using  eq.  (S.2.5).  The  result  is 


C - - 0.090  I-  Cp  u2)  . 
m qz 


(5.2.21) 


In  analogy  to  eq.  (5.2.8)  we  conclude  from  eq.  (5.2.21)  that  the 
momentum  flux  term  in  the  momentum  equation  should  be  increased  by  9% 
in  the  present  case. 

Assuming  as  before  constant  viscosities,  we  obtain  for  the  average 
heat  dir  ipation  function 


$ - 2u  u^  ^7  7.840  + (1.699  v * u')^  (5.2.22) 


For  the  product  -u  T2  we  obtain 


Q T - - 2u  n2  iy  f (R)  . 2u  u2  -L-  • 14.364  . 
* R^' 


(5.2.23) 


The  right-hand  side  of  the  energy  equation  (5.2.12)  is  therefore 

R ♦ $ « H - n T^  ^1  - 0.454  +1^^  0.059  |l  ♦ 0.588  (.S.2.. 


Thejerror  which  is  introduced  by  replacing  (j  by  -u  Tj  is  about  45%  of 
|u  Tjj.  As  in  the  previously  treated  case,  such  errors  will  be  seldom 
tolerable. 

We  may  conclude  from  these  examples  that  the  magnitudes  of  correction 
terms  are  essentially  the  same  for  flow  profiles  described  by  eq. (5.2.4) 
as  for  profiles  described  by  eqs.  (5.2.19)  and  (5.2.20).  Using  conven- 
tional tube  flow  equations,  e.g.  from  Reference  4,  for  interior  ballis- 
tics calculations,  one  introduces  errors  in  the  momentum  and  energy 
equations  which  are  of  the  order  of  9-50%  of  several  of  the  terms. 

The  examples  indicate  that  an  investigation  of  magnitudes  of  the 
correction  terms  is  necessary  whenever  average  flow  equations  are  used 
to  describe  non-steady  tube  flows. 


6.  CONCLUSIONS 


\ 


Tube  flow  governing  equations  for  average  properties  differ  from 
one-dimensional  flow  equations.  The  differences  are  caused  by  the  fact 
that  averaging  of  functions  and  multiplication  of  functions  are  not 
commutative  operations.  The  magnitudes  of  the  differences  depend  on  the 
particular  problem.  If  the  unsteady  tube  flow  is  of  a type  which  is 
encountered  in  interior  ballistics,  then  several  terms  in  the  equations 
can  be  in  error  by  up  to  50%. 

One  consequence  of  the  various  correction  terms  in  the  equations  is  1 

that  the  continuity  and  momentum  equations  cannot  be  combined  to  yield 
a simple  equation  for  the  average  axial  velocity  component.  Instead, 
the  original  equation  for  the  average  axial  momentum  component  is  the 
simplest  form.  Correspondingly,  the  energy  equation  should  be  formulated 
for  the  internal  energy  per  unit  volume  instead  of  using  the  specific 
internal  energy. 

The  popular  approximation  of  the  heat  dissipation  function  by  the 
product  of  average  velocity  and  average  shear  stress  is  appropriate  only 
in  the  simplest  cases,  e.g.,  for  steady  flows  or  flows  with  a Hagen- 
Poiseuille  velocity  profile.  In  other  cases  the  approximation  can  be 
off  by  up  to  50%.  In  cases  of  more  complicated  flows  even  the  sign  of 
the  approximation  can  be  wrong.  Hence  the  approximation  should  not  be 
used  unless  one  can  demonstrate  its  validity  in  the  particular  case  of 
application. 

Formulas  for  the  correction  terms  in  the  governing  equations  can  I 

be  derived  for  other  than  simple  tube  flows  following  the  outline  of 
this  paper.  The  derivations  which  are  presented  in  some  engineering  text- 
books neglect  important  first-order  terms.  The  apparent  success  of  the 
inaccurate  equations  for  the  treatment  of  tube  flows  is  probably  due  to 
the  fact  that  the  neglected  terms  are  small  or  vanish  for  steady  flows, 
for  which  most  comparisons  between  calculation  and  experiments  are  made. 
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APPENDIX  A 


FORMULAS  IN  CYLINDRICAL  COORDINATES 

In  Section  2 through  4 a convenient  cartesian  tensor  notation  was 
used  to  derive  all  formulas.  If  the  results  are  to  be  used  for  axially 
symmetric  tube  flows,  then  it  is  more  convenient  to  use  cylindrical 
coordinates.  In  this  appendix  we  express  the  important  formulas  in 
these  coordinates. 

Stokes  equation  for  the  stress  tensor  of  an  isotropic  fluid  can  be 
expressed  in  coordinate  independent  form  as  follows  (Reference  3,  page 
132;  Reference  2,  page  144] 


2 

T ■ 2u  c ♦ (u'  - J w)  div  u**I  , (A.l) 


where  t is  the  stress  tensor,  e is  the  strain  rate  tensor,  u*  is  the 
velocity  vector  of  the  fluid,  and  I is  the  unit  tensor.  The  viscosities 
W and  u'  in  eq.  (A.l)  need  not  be  constant,  i.e.,  the  fluid  under 
consideration  need  not  be  homogeneous.  However,  u as  well  as  u'  must 
be  positive  or  zero.  . 

Next  we  compute  the  work  rate  of  the  viscous  forces  acting  within 
an  arbitrary  volume.  To  this  end  we  compute  the  inner  product  of  the 
viscous  forces  V’t  with  the  velocity  vector  u*  and  integrate  over  the 
volume.  The  result  can  be  expressed  as  follows: 


/ (u*  • (V*t))  dV  * / (u*  • (T*n))  dS  - /$dV  . (A.2)  ‘ 


In  eq.  (A.2)  n is  a unit  vector,  orthogonal  to  the  surface  of  the  volume 
V and  pointing  inward,  and  is  the  heat  dissipation  function  defined  by 


^"^race  = <e">trace  * ' I ^"^race  ' 


Because  div  u*  ■ ^^^trace*  expressed  as  follows: 


Eq.  (A. 4)  corresponds  to  eq.  (2.10)  in  cartesian  coordinates.  Eq. 

(A. 2)  corresponds  to  eq.  (2.13)  in  cases  where  the  body  forces  Xj 
are  absent. 

We  now  express  the  various  quantities  appearing  in  the  equations 
using  cylindrical  coordinates.  Let  the  coordinates  be  r,  and  z. 
Components  of  vectors  and  tensors  we  denote  by  attaching  corresponding 
indexes  to  the  quantities.  Thus,  the  velocity  vector  u*  is 


u*  - (u^.  u^,  u^)  . 


(A.5) 


The  strain  rate  tensor  e has  the  following  components  (Reference  2, 
Page  602) 


(A.  6) 


The  vector  7»t  has  the  components  Ty,  Tij,  and  Tj,  representing  the  vis- 
cous forces  acting  in  the  three  coordinate  directions.  The  components 

are 


(V*T), 


3t  , 3t„.  3t  . 

— 2- + — Li  + — Ei  + i (T  -T 

3r  r 3«(.  3z  r '•  rr  i 


3r 


iflil  + + 2 i T 

r 3^  32  r ri^  * 


T 

z 


r 34 


(A.  7) 
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We  specialize  these  equations  for  the  case  of  an  axisymmetric  flow 
without  swirl  through  a circular  tube.  The  flow  is  then  independent 
of  the  coordinate  and  the  ^-component  of  the  velocity,  u^,  is 
zero.  In  order  to  simplify  the  notation  we  denote  the  non-zero 
velocity  components  as  follows: 


Uj  =*  u(t,z,r) 


(A.  8) 


Uj.  “ v(t,z,r)  . 


Let  R be  the  radius  of  the  tube.  The  average  density  is  then  defined 
by 


2 

P(t,2)  = -T  / P(t.z.r)  r dr  . 


The  average  axial  velocity  is 


R^o 


R 


5(t,z)  = -2 / p(t,2,r)  u(t.z,r)  r dr 


R iy(t,2)  o 

The  local  continuity  equation  is 


II  . i |_  (r  p V)  . (p  u)  = 0 


The  corresponding  equation  for  the  averages  is 


9p  3 

^ ^ ^ CP  0)  = 0 . 


(A.  9) 


(A. 10) 


(A.ll) 


(A. 12) 
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The  local  balance  of  momentuin  is  expressed  by  the  following  two  differ- 
ential equations: 


" 7 I7  " It  f'’  “ " I?  “ ’'r 


* 7 If  (p  r u V)  * (P  u^)  ♦ II  . 


ycA.is) 


The  third  momentuin  equation  is  staisfied  identically  because  of  our 
symmetry  assumptions.  The  right-hand  sides  of  eqs.  (A. 13)  depend  on 
the  strain  rate  tensor  e by  eq.  (A. 7)  and  (A.l).  In  our  case  the  strain 
rate  tensor  has  the  following  components: 


3v 

^rr  ^ 3r  ’ ^ 


1 3u 

= — V , E = -T—  . 

r zz  32  ’ 


‘r»-  0 ■ ■ 


V (A.  14) 


rz 


i ini 

^ 2 |_3z  3rJ  • 


The  divergence  of  the  velocity  vector  is 


f 


1 1 
[ 


Substituting  (A. 16)  into  (A. 17),  we  obtain 


The  Donentua  equation  for  the  averages  is 


4 

where  the  average  pressure  ^ is  defined  by 

2 ^ 

7 ■ “2  / P(t,r,r)  r dr 
R o . 

and  the  average  viscous  force  by 


Tj  ■ % / T^(t,z,r)  r dr  . 


R 0 


The  correction  term  in  eq.  (A. 19)  is 


^m  • hy  ^ j p r dr 


The  equation  for  the  local  internal  energy  is 


ev)  *1^(0  e.u)  . p 1-  (r  v)  ♦ |^J 


(A. 18) 


(A. 19) 


(A.  20) 


(A. 21) 

I 

i 


(A. 22) 


. (A. 23)  I 


r 
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The  heat  dissipation  function  $is  given  by  eq.  (A. 4).  Sub- 
stituting the  strain  rate  tensor  components  from  eq.  (A. 6)  into 
(A. 4),  we  obtain 

The  energy  equation  for  the  average  internal  energy  is 

0 « 0)  • P It  - H . J • C,J  • ' 

triiere 


(A.  24) 


(A.  25) 


2 ^ 

• -rs-  / p e r dr  , 

(A.  26) 

R 0 

R 

/ (Q  - div  q)  r dr  , 

(A.  2 7) 

o 

R 

- / $ r dr  , 

(A. 28) 

0 

R ^ _ 

(A.  29) 

C.J  ■ ft  (P  0 f) 


^jlt  (P  U.)  p 


(A.  30) 
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Ihe  integrand  in  (A. 33)  can  be  obtained  from  eq.  (A. 17).  Carrying 
out  the  substitutions,  we  obtain  for  the  second  correction  term 


C.  - nT  - ^7(vT  ♦uT)rdr» 

* * r 0 ' * 

•“[?  If 


(A.  35) 


* The  separate  expressions  for  and  Wq  might  be  of  interest  for  the 
^scussion  of  approximations.  Usually  C.}  is  neglected  completely  and 
W_  is  assmed  to  be  small  by  plausibility  arguments.  The  total  correc- 
tion, which  is  caused  by  replacement  of  ^ by  -u  Tj,  is  the  sum  of  Wq  and 
C^.  Ihe  sum  is,  of  course. 


¥ +0.  - nT 

0 ♦ z ^ 


■ u T ♦ 


jif-IJ-KK-f-® 

o 

2 ^ 

♦ (w'  - j w)  (div  u*)T  r dr  . 


©1 


(A.  36) 


Eq.  (A. 36)  may  be  more  advantageous  for  actual  calculations  than  (A. 34) 
and  (A. 35)  because  it  does  not  contain  derivatives  of  the  viscosities. 
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LAGRANGE'S  APPROXIMATION  TO  INTERIOR  BALLISTICS  FLOW 

I^  is  plausible  to  assume  that  in  a gun  tube  the  average  axial  vel- 
ocity u(z,t)  of  each  cross-section  is  a linear  function  of  the  distance 
z from  the  breech  of  the  weapon.  Let  Zp(t)  be  the  location  of  the  pro- 
jectile and  Up(t)  = dzp/dt  be  its  velocity.  The  above-mentioned 
Lagrange's  approximation  is  then 


i 

In  the  classical  Lagrange's  approximation  (B.l)  is  supplemented  with 
the  assumption  that  the  gas  density  in  the  tube  is  a function  of  time 
only. 

In  this  appendix  we  shall  investigate  some  consequences  of  these 
assumptions.  Particularly  we  are  interested  in  finding  if  there  is  a 
three-dimensional  viscous  tube  flow  which  satisfies  Lagrange's  assumptions. 

First  we  will  consider  flows  in  which  the  gas  density  is  a separable 
function  of  z,  t,  and  the  radial  coordinate  r: 


p(r,z,t)  » g(r)  • P(z)  • K(t)  . (B.2) 


Later  we  will  specialize  our  considerations  to  the  classical  Lagrange's 
approximation,  where  P(z)  and  g(r)  are  constants. 

We  assume  that  g(r)  is  non-dimensional  and  normalized  by 


%7  g(r)  r dr  - 1 . (B.3) 

R o 


The  product  of  the  other  two  functions  in  eq.  (B.2)  is  then  the  average 
density 


7 ^ 

5(z,t)  ■ P(z)  K(t)  * “T  / p(r.*.t)  r dr  . (B.4) 

R o 


In  eq.  (B.l)  the  variables  z and  t are  already  separated.  We  assume 
that  the  dependence  of  u on  r can  be  separated  also,  such  that 


u(r,z,t)  ■ £(r)  • u(z,t)  . 


(B.5) 


It  was  shown  in^  Section  4 that  a reasonable  definition  of  the  aver- 
age axial  velocity  u in  terms  of  the  local  velocity  u is 


2 ^ 

-=■  / u pr  dr  . 
R o 


(B.6) 


With  this  definition  of  u we  have  the  following  relation  between  the 
nondimensional  functions  fCr)  and  g(r): 

7 R 

~T  / gCr)  r dr  » 1 . (B*7) 

R o 


The  funtions  u(z,t)  and  p(z,t)  satisfy  the  continuity  equation  (4.6), 
i.e. , 


3t 


0 . 


(B.8) 


Substituting  the  product  P*K  for  p into  eq.  (B.8)  and  the  expression 
(B.l)  for  u we  obtain 


P(z)  • K'(t)  > K(t) 


Zp(t)  dz 


0 . 


This  equation  has  solutions  of  the  form 


(B.9) 


(B.IO) 


with  arbitrary  m.  In  eq.  (B.IO)  Pq  is  the  average  density  of  the  gas 
in  the  tube  at  time  t » 0.  For  m « 0 we  obtain  the  classical  Lagrange's 


61 


solution.  More  generally  we  may  assume  p to  be,  e.g.,  of  the  form 

.(o) 


_f2 L 


CB.ll) 


with  arbitrary  m,  Aq  and  For  physical  reasons  m ^ 0,  Aq  ^ 0 and 
A*  ^ -Aq. 

Next  we  investigate  the  radial  velocity  component  v(r,z,t).  The 
local  continuity  equation  is 

It  ^ l|_(r  p V)  > w)  > 1^  (p  u)  = 0,  (b.12) 


where  w is  the  angular  velocity  component.  Let  w « 0 (no  swirl)  and 


V ■ V(z,t)  • h(r)  . 


(B.13) 


Eq.  (B.12)  can  then  be  expressed  by 


it . 


-.Id 


^ • g(r)  ♦ P V - ^ [r  g(r)  h(r)]  ♦ g(r)  f(r)  [p  U]  ^ 0 . (B.14) 


Eliminating  3p/3t  from  eq.  (B.14)  with  the  aid  of  eq.  (B.8),  we  obtain 


I-l^f(r)]  g(r)^^^  ♦ p V i g(r)  h(r)]  » 0 . 


This  equation  is  satisfied  by  the  functions 


(B.15) 


v(z.t)  . R i 


and 


h(r) 


R r g 


/ (1-f)  g r dr 


(B.16) 


(B.17) 
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Eqs.  (B.IS),  (B.16),  and  (B.17)  give  the  local  radial  velocity 
v(T,z,t,)  for  any  flow  profile  specified  by  p,  u,  g,  and  f.  Clearly, 
the  factor  h(r)  is  not  normalized  in  the  same  manner  a£  f(r).  There- 
fore, v(z,t]  is  not  an  "average"  radial  velocity.  If  p is  given  by 
eq.  (B.ll),  then 


• i 


(B.18) 


It  is  interesting  to  note  that  v is  not  zero  for  z > 0 and  z ■ Zp.  This 
is  an  indication  that  the  assumption  (B.5)  about  separation  of  variables 
fot  the  axial  velocity  is  not  valid  in  the  vicinities  of  the  breech  and 
the  projectile.  These  regions  we  will  therefore  exclude  from  our  consid- 
erations . 


In  summary  we  have  found  a flow  field  in  a cylindrical  tube  which 
satisfies  the  local  continuity  equation  and  Lagrange's  assumption  (B.l). 
The  flow  field  is  described  by  the  following  functions 


u 


f(r) 


P ■ PCz.t)  • g(r) 


> (B.19) 


V ■ V(z,t)  • h(r)  J 

If  one  specifies  u,  then  F is  given  by  eq.  (B.ll)  and  7 is  given  by 
eq.  (B.18).  The, dependence  of  the  flow  field  on  r can  be  specified  by 
two  functions,  g(r)  and  f(r),  from  which  h(r)  is  then  computed  by  eq. 

(B.17).  The  function  g(r)  has  to  be  positive  for  0 ^r  < R and  normal- 
ized by  eq.  (B.3).  We  assume  also  that  g'(0)  * 0.  The  function  f(r) 
has  to  satisfy  the  conditions 

1 


and 


f'(o)  - 0 

f(R)  - 0 . 


> (B.20) 


i 
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It  is  normalized  by  eq.  (B.7).  Hence  we  have  a total  of  three  condi- 
tions which  restrict  the  choice  of  f'(r). 


Instead  of  specifying  f(r)  we  may  also  specify  h(r).  The  function 
f(r)  is  then  given  by 


f(r)  - 1 - [r  g(r)  h(r)]'  (B.21) 

The  function  h(r)  has  to  satisfy  the  following  four  conditions 


h(0)  - 0 , 
h’'(0)  « 0 , 
h(R)  - 0 , 
h'(R)  - 1/R  . 


(B.22) 


The  flow  field  also  has  to  satisfy  the  momentum  equations.  The 
analysis  of  these  equations  is  more  complicated  because  it  involves,  in 
addition  to  the  velocity  and  density  functions,  the  pressure  function 
p(r,z,t)  and  the  viscosities  v and  u',  which  in  general  are  variable. 

We  have  tried  to  restrict  our  considerations  to  the  special  case  with 
constant  viscosities  and  classical  Lagrange's  approximation  (i.e., 
g(r)  - ID-  We  have  found  that  the  flow  field,  as  defined  by  eq.  (B.19), 
does  not  satisfy  the  momentum  equations  in  this  special  case.  The  for- 
mulas for  correction  terms,  which  we  shall  derive  at  the  end  of  this 
appendix,  are  therefore  to  be  considered  as  approximations  only. 


SC^D  = 1>  then  the  flow  field  is  given  by 


Note  that  according  to  eq.  (B.24)  the  radial  velocity  component  v is 
independent  of  z for  the  classical  Lagrange's  approximation. 


The  divergence  of  the  velocity  u*  of  this  flow  is  (see  eq.  (A. 15)) 

(B.26) 


* 1 3(rv)  3u  3u 

div  U*  • — * + TT  “ TT  “ , 


r 3r 


3z  3z  Zp(t)  ■ 


The  components  of  the  forces  caused  by  the  viscous  stress  tensor  are 
given  by  eq.  (A. 18).  In  our  case,  assuming  constant  viscosities,  we 
obtain  for  the  r-component 


Tj.  - 2u  V i [(r  h')'  - ih]  ♦ U If  f ' 


[-2w  f • + u f • ] If  = - 


(B.27) 


The  z-component  of  the  force  is 


T^  - u [r  f]' 


w z ^ i (f  f ' ] ' 
p 


(B.28) 


Ihe  local  momentum  equations  are  according  to  eqs.  (A. 13)  and  (B.12) 


3v  3v  ■ 3p  _ T 

p n ^ ^ ^ ^ 37  ^r  ' 


^ 3t 


3u  3u  3u  ^ ^ _ ~ 
— 4’  ov  •:r~  + ou  Tr“  * T 
3t  3r  ^ 3z  3z  z 


CB.29) 


(B.30) 


For  a flow  field  described  by  eqs.  (B.23)  through  (B.24)  eq.  (B.29)  is 


(B.31) 
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Differentiating  eq.  (B.31)  with  respect  to  z,  we  obtain 


The  function  p(r,z,t)  is  therefore  of  the  form 

p(r,z,t)  ■ Pj^Cr.t)  P2(z.t) 


(B.  32) 


(B.  33) 


Eq.  (B,31)  might  be  used  to  determine  the  function  p, (r,t)  if  the  other 
terms  in  the  equation  are  given.  ^ 


Eq.  (B.30)  is  in  the  present  case,  i.e.,  for  the  flow  described  by 
eqs.  (B.23)  through  (B.25) 


From  eq.  (B.35)  we  can  conclude  that  the  expression  (?p2/3z)/z  is  inde- 
pendent of  z.  The  various  terms  in  this  equation  are  products  of  func- 
tions of  r and  t and  the  equation  has  the  form 


gj(t)fj{r)  + g2(t)f2(r)  ♦ g^(t)  + f^(r)  * 0 . (B.36) 


Such  an  equation  can  be  satisfied  identically  only  if  either  all  gi(t) 
are  constant  or  all  f^  (r)  are  constant.  The  case  with  all  f^Cr)  « const, 
corresponds  to  a slug  flow  in  which  we  are  not  interested.  Assuming  the 
time  functions  gj^(t)  to  be  constant,  we  obtain  first 

u 


g,  (t)  ■ P - A 


<ls_  1 

P 2 (O)  = 

o p''  dt  z 

P 


Eq.  (B.37)  can  be  integrated  to  yield 


■ 'p"»  PAiT^ 


The  corresponding  velocity  of  the  projectile  is 


u (t)  * :r' 


P P 


o (l-At/p^)‘ 


The  density  as  a function  of  time  is 


pCt)  » A ^ = 
P 


The  first  time  function  in  eq.  (B.35)  is  then 


gj(t)  - P 


!Ld  La 

“p  V.V  V 


Let  the  value  of  the  third  time  function  be  B.  We  obtain  then 


’"2  . ..  “p..  L 


Eq.  (B.35)  takes  now  the  form 


A(f*f^^R  h f)  + B - u i (r  f’)'  “ 0 . 


(B.37) 


(B.38) 


(B.39) 


(B.40) 


(B.41) 


(B.42) 


(B.43) 
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(B.44) 


with  the  constants 


and 


(B.45) 


The  first  factor  in  eq.  (B.45)  is  a Reynolds  number  of  the  projec- 
tile. It  is  typically  of  the  order  10^.  The  first  factor  in  eq.  (B,46) 
for  B*  is  a Poiseuille  number  of  the  flow.  Its  magnitude  is  of  the 
order  lO'^.  The  function  h(r)  has  to  satisfy  the  differential  equation 
(B.44)  and  the  four  boundary  conditions  (B.22).  Since  eq.  (B.44)  is  of 
third  order  only,  the  function  h(r)  will  in  general  not  satisfy  all 
boundary  conditions.  We  conclude,  therefore,  that  Lagrange's  approxi- 
mation is  not  consistent  with  a flow  field  which  can  be  described  by 
separation  of  variables,  eqs.  (B.23)  through  (B.2S). 

In  Section  5.2  we  have  nevertheless  used  this  flow  field  to  obtain 
estimates  of  correction  terms  because  we  were  not  able  to  find  an  exact 
three-dimensional  solution  of  Navier-Stokes  equations  which  is  also 
consistent  with  Lagrange's  approximation. 

Next  we  compute  the  various  correction  terms  for  the  average  flow 
equations  using  the  formulas  of  Appendix  A and  the  flow  described  by 
eq.  (B.19). 


The  correction  term  Cm  of  the  momentum  equation  is  given  by  eq. 
(A. 22): 

R 


C - ^ 

B 


(B.47) 


Substituting  the  expressions  (B.23)  and  (B.25)  for  u and  p respectively 
into  eq.  (B.47),  we  obtain 


■[- 


j r drj  Ij  (5  u^)  . 


(B.48) 


Eq.  ^B.48)  is  of  course  valid  for_ any  functions  u = u (z,t)_*  f(r)  and 
p ■ p(z,t).  In  Lagrange's  case  p is  independent  of  z,  and  u is  linear 
in  z.  We  obtain  in  this  case 


1 r dr|  2zp(t)  ^ 


(B.49) 


The  energy  equation  has  several  correction  terms.  First  we  consid- 

R 


er  the  term  given  by  eq.  (A.  29): 


el 
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dr  . 
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Me  substitute  the  flow  field  formulas  (B.19)  into  this  equation  and 
obtain 

R 

r a.T  “1 

dr 
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This  expression  can  be  transformed  b/  partial  integration  and  some 
algebra  into 


“el 


R 3z 


(r  h)'  p dr 
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(r  h) ' p dr  . 


(B.52) 


Eq.  (B.52)  shows  that  the  correction  term  Cel  ^ following 

cases: 

(a)  g = 1 and  0,  i.e.,  the  classical  Lagrange's  case; 

3p 

(b)  g = 1 and  * 0»  !•«.,  p and  p independent  of  r; 

(c)  h = 0,  or  f = 1,  i.e.,  slug  flow. 


The  second  correction  term  of  the  energy  equation,  Ce2»  depends  on 
the  local  internal  energy.  According  to  eq.  (A. 30), 


C 


e2 


■ 

u e r dr  , 

. 


(B.S3) 


If  the  flow  field  is  given  by  eq.  (B.19),  then 


(B.S4) 


This  correction  term  vanishes  if  the  internal  energy  e is  independent 
of  the  radial  coordinate  r. 


The  remaining  correction  terms,  Wq  and  C^,  in  the  energy  equation 
are  cause^  ^ the  replacement  of  the  heat  dissipation  function  ^ by  the 
product  -u  Tj.  The  heat  dissipation  function  A is  according  to  eq. 

(A. 24) 


(B.55) 


The  correction  term  Wq  is  according  to  eq.  (A. 34) 


2 ( r 3u  _ 3 r 3^  + (yr  . 2 ) JL  fl 

3“'3zzpj 

o ^ 

■ \ ■ f f j r dr  . 


r dr  = 


(B.58) 


The  correction  term  C^-is  given  by  eq.  (A. 35).  In  the  present  case 
with  constant  viscosities  we  obtain  by  substituting  eqs.  (B.27)  and 
(B.28)  into  eq.  (A. 35) 
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livh-^f’  +1111 
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(B.59) 


By  partial  integration  and  using  eq.  (B.21)  we  can  show  that 


/ R h f'r  dr  ■ 
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(B.60) 
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u / (r  f')'  dr  = 2u  z ^ f ' (R)  . (B.63) 


o 


The  correct^o^  termsJVo  and  C,j  vanish  for  a slug  flow.  However 
for  such  a flow  u and  $ are  also  zero.  * 
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ABSTRACT.  A finite  element  procedure  for  linear  stress  analysis  of 
axisymmetric  bodies  subjected  to  torsional  loads  is  developed.  The  formu- 
lation is  based  on  the  assumed  stress  hybrid  model.  Applications  are  made 
to  cylinders  and  cones.  Excellent  agreements  are  obtained  between  the 
exact  solution  and  the  finite  element  results. 

I.  INTRODUCTION.  The  present  study  is  motivated  by  the  consideration 

of  the  stress  analysis  of  artillery  projectiles.  During  firing,  the  projectile 
is  subjected  to  a combination  of  various  loads  which  are  (1)  axial  load- 
due  to  linear  acceleration  of  the  projectile,  (2)  centrifugal  load-due  to 
angular  rotation  of  the  projectile,  (3)  torsional  load-due  to  angular  accel- 
eration of  the  projectile,  (4)  internal  load-due  to  setback  on  H.E.  and  (5) 
external  load-due  to  gun  tube  constraint,  band  pressure  and  balloting.  In 
view  of  the  complexity  of  the  geometry  of  the  projectile,  a finite  element 
analysis  must  be  performed  .in  order  to  determine  the  stresses  and  deforma- 
tions in  the  projectile.  Since  the  projectile  has  an  axis  of  rotational 
symmetry,  it  is  only  logical  that  an  axisymmetric  ring  element  would  model 
it  more  accurately  and  efficiently.  In  a MIT  study  [1],  which  was  performed 
for  AMMRC  under  a contract,  an  axisymmetric  ring  element  was  developed  based 
on  the  assumed-stress  hybrid  finite  element  model.  However,  it  can  only 
treat  axisymmetric  loads  of  the  projectile. 

It  is  the  goal  of  the  present  analysis  to  develop  an  axisymmetric  solid 
of  revolution  element  which  can  be  used  to  determine  the  stresses  and  deform- 
ations in  axisymmetric  structural  bodies  under  torsional  loads.  The  assumed 
stress-hybrid  model  is  employed  to  derive  the  element  stiffness  matrix  such  j 

that  the  results  can  be  combined  with  that  from  the  MIT  study.  • 

1 

Other  finite  element  formulations  for  solution  of  axisymmetric  structural  | 

bodies  under  torsion  can  also  be  made.  The  axisymmetric  quadrilateral  ele-  ; 

ment,  based  on  the  displacement  formulation  in  the  ANSYS  finite  element  program  , 

[2],  can  be  used  for  modeling  axisymmetric  structures  with  non-axisymmetric  ^ 

loadings  such  as  bending,  shear  or  torsion.  Different  finite  element  formu-  f 

lations  have  also  been  developed  for  the  solution  of  torsion  of  nonprismatric  j 

bars  [3,4].  ] 

II.  DERIVATION  OF  ELEMENT  STIFFNESS  MATRIX.  The  assumed-stress  hybrid 

model  is  based  on  a modified  complementary  energy  principle  [5].  It  assumes  j 

compatible  displacements  along  the  interelement  boundaries  and  a stress  field  1 

which  satisfies  equilibrium  within  each  element.  j 
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The  total  complementary  energy  of  one  finite  element  is  given  by 


IT  = i 1,,  S.  » o.  . a,  p dV  - T.u.dA 
c 2 V ijkt  ij  k-t  All 


where 


(1) 


^ijk^ 


V 

A = boundary  area 

= surface  traction 

u.  = prescribed  boundary  displacement 

The  function  ir^  is  a minimum  when  the  stresses  satisfy  the  equilibrium 
condition.  In  deriving  the  element  stiffness  matrix,  the  displacements 
along  each  boundary  of  the  finite  element  are  expressed  in  terms  of  the  nodal 
displacements  ^ and  certain  interpolation  functions  JL,  such  that  the  displace- 
ment compatibility  conditions  with  the  neighboring  elements  are  satisfied. 

u = (2) 

The  element  stresses  in  the  interior  of  the  element  are  then  expanded  in  terms 
of  a finite  number  of  stress  parameters  ^ 

a = P6  (3) 

where  ^ is  chosen  to  satisfy  the  homogeneous  equilibrium  equations.  The 
surface  tractions  can  be  written  in  the  form  of 

T = R6  (4) 

where  R is  obtained  by  applying  the  element  boundary  conditions  to  Substi- 
tuting Eqs.  (2)  to  (4)  in  Eq.  (1)  one  obtains, 

IT  = i - B^Gq  (5) 


= compliance  constants  of  the  material 
= stresses 
= volume 


where 


The  best  approximate  solution  for  6 for  the  problem  is  obtained  by 
setting  to  zero.  The  result  is  ** 

(8) 

From  which , 

6 = H'^G  q 

The  first  term  in  the  expression  of  (Eq.  (1))  represents  the  total  strain 
energy  U in  the  element.  By  substituting  Eq.  (9)  into  U one  obtains: 

_ 1 „T  _T  „-l  _ 

U = :5-qGH  Gq 

^ mm  ^ mm  0^ 

By  definition,  the  strains  energy  can  be  written  as  follows: 

1 T 

U = 7 q K q 

where  K is  the  element  stiffness  matrix. 

00 


(9) 


(10) 


(11) 


By  comparing  Eqs.  (10)  and  (11),  one  obtains  the  element  stiffness  matrix  for 
the  hybrid  stress  model: 


T - 1 
G H * G 


(12) 


III-  FORMULATION  OF  AXISYMMETRIC  SOLID  OF  REVOLDTION  ELEMENT  BY  ASSUMED- 
STRESS  HYBRID  MODEL.  Since  only  structural  problems  in  the  shape  of  body  of 
revolution  are  considered,  an  axisymmetric  solid  of  revolution  element  is 
developed.  For  convenience,  the  cylindrical  coordinates  are  used.  Let  u, 

V and  w denote,  respectively,  the  components  of  displacement  in  the  radial  (r) , 
tangential  (0)  and  axial  (z)  directions.  The  relationships  between  the  com- 
ponents of  displacements  and  the  strain  components  are: 

9u 
“ 3r 


1 
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The  equilibrium  equations  are: 
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(14) 


In  the  application  of  these  equations  to  the  torsional  problem  the 
semi-inverse  method  may  be  used  and  the  components  of  displacements  u and 
w are  assumed  to  be  zero.  It  can  be  shown  that  the  solution  obtained  on 
the  basis  of  such  an  assumption  satisfies  all  the  equations  of  elasticity 
and  therefore  represents  the  true  solution  of  the  problem  [6]  . Substituting 
in  Eq.  (13)  u = w = 0,  and  making  use  of  the  fact  that  from  symmetry  the 
displacement  v does  not  vary  with  the  angle  6,  one  obtains: 


c 
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rz 
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3v  V 

^re  “ 3r  ■ 7 (15) 


3v 

Y s — » 

^ 0z  3z 


Equation  (15)  together  with  Hooke's  law  determines  that  of  all  the  six  stress 
components  o , o , o , x » ^ o*  ^9  ^ 9 ^92  different  from  zero. 

As  a result  Sf  tnis.^the^lirsf  two  of  Eq.  (13)  are  identically  satisfied,  and 
the  third  of  these  equations  becomes: 


0 


(16) 


Hence,  in  the  subsequent  formulation  of  an  axisymmetric  solid  of  revolution 
ring  element  one  is  only  concerned  with  the  displacement  v and  the  equilibrium 
equation  given  by  Eq.  (16) . The  ring  element  has  four  nodes  and  a general 
quadrilateral  cross-section  in  the  r-z  plane  (Fig.  1). 
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Axis  of  Revolution 


General  Quadrilateral  Axisymmetric  Solid  of  Revolution  Element 

Global  System 


A.  Element  Uisplacement  Assumptions 


In  the  assumed-stress  hybrid  model,  the  interpoic-tion  function  must  be 
selected  in  such  a manner  that  it  assures  the  displacement  compatibility 
between  neighboring  elements.  Thus,  for  the  4 node  4 degree -of -freedom  axi- 
symmetric element  the  appropriate  choice  is  a linear  interpolation.  In  terms 
of  the  nodal  quantities  it  assumes  the  following  form: 


S is  the  boundary  coordinate  which  is  normalized  for  each  element  side  such 
that  it  varies  between  -1  and  +1.  v^  and  are  the  values  of  v at  the  nodes 

i and  i+1  with  v^  = v . Eq.  (17)  is^the  interpolation  function  for  the  ith 
side  of  the  element  (l  = 1,  2,  1).  The  value  of  r and  z along  the  ith  side 

can  be  related  to  the  nodal  coordinate  r . , z^  in  the  same  manner  as  the  dis- 
placement interpolation  function;  i.e. 


When  interpolation  of  the  displacements  in  the  interior  of  an  element 
required,  the  following  bilinear  interpolation  function  may  be  used: 


where  v.  is  the  value  of  v at  the  ith  node  of  the  element 
the  following  equation: 


P.  is  defined  by 


C and  n are  related  to  r,z  by  the  following  equations 
4 


z(5.n)  = j 

T 

S P.z. 

. , 11 

1=1 

4 

r(c,n)  = j 

E P.r. 
i=l  " " 

(21) 


Equation  (21)  describes  a coordinate  transformation  between  a general  quadri- 
lateral in  the  r-z  plane  and  a square  (with  side  length  equal  to  2)  in  the 
C-n  plane  (Fig.  2). 
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(-1,-1) 

Fig.  2 Solid  of  Revolution  in  Transformed  ^;-n  System 

B.  Element  Stress  Assumptions 

There  is  no  prior  knowledge  concerning  the  selection  of  proper  stress 
assumptions  except  they  must  satisfy  the  equilibrium  equations  exactly.  In 
the  present  analysis,  the  following  stress  assumption  is  chosen: 


re 


$1  ♦ ^2^  * 


(22) 


"ez  = " ^6" 
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Substituting  Eq.  (22)  into  Eq.  (16),  one  obtains 
^2*^6"  26i  ^ - 202  - 263  I = 0 


(23) 


In  order  for  the  assumed  stress  (Eq.  (22))  to  satisfy  the'  equilibrium  (Eq.  (16)) 
exactly,  the  following  relationships  must  hold: 


302  + 6g  = 0 


26j  = 0 


2B3  = 0 


(24) 


As  a result,  the  stress  assumptions  (Eq.  (22))  becomes 


"re  = 62r 


"z6  = h * - ^®2" 


(25) 


By  comparing  Eqs.  (3)  and  (25),  one  obtains; 


re 


‘ze 


(26) 


6 = 


(27) 
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(28) 


C.  Definition  of  Compliance  Matrix 

The  matrix  ^ in  Eq.  (6)  is  the  compliance  matrix  that  relates  stresses 
a to  strains  e: 


e = s a 


(29) 
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For  an  axi symmetric  element,  the  matrix  H in  Eq,  (6)  may  be  written  as 


ji=  \ £^(r,z)  (r,z)  rdrdedz  (32) 

n 

Using  the  coordinates  transformation  defined  in  Eq.  (21)  and  performing  the 
0-integration  analytically,  the  expression  for  H becomes 

iL=  (C.ri)£X('^»'l)rC5.n)|j|d5dn  (33) 

r and  z can  be  ,>ritten  in  terms  of  c and  n as 

r = a^  + a^c  + a2n  + 

(34) 

z = bp  + bjC  + b2n  + b^Cn 


wh^»TA 


I tI  _ 3(z.r) 

' ' ■ a(C.n) 


= + A2?  + Ajfi 


where 


Ai  = - ajb2 


^3  = ^2^3  • 


E.  Calculations  of  Matrix 


For  the  calculation  of  the  matrix  ^ in  Eq.  (7) , expressions  for  tractions 
and  displacements  along  the  boundary  are  required.  The  traction  T for  the 
ith  side  of  an  element  are  given  in  terms  of  the  stresses  by  ® 


^Vi  = * ^re^Vi 


where 


(vpi  = 


(Vi  = 


r.  r. 
1+1  1 


z.  - z.  . 
1 1+1 


^i  = /(’^i+1  - ’^i)  " (^i+r  ^i^ 

Substituting  Eq.  (25)  into  Eq.  (38),  one  obtains 

(Tg).  = [-3z(vj).  + r(v2).]e2  + (v^-S^  + r(v2)i65 
From  Eqs.  (4),  (27)  and  (40),  one  obtains 

L=  (V 

R=  [-3zvj  + rv2,  v^,  1^2] 

The  boundary  displacement  are  related  to  the  nodal  displacements,  q,  as 
u = L q 


with 


iqi*  q2.  q3*  q4J  = ivj,  vy  v^j  (43) 

Linear  boundary  displacements  are  assumed  as  given  by  Eqs.  (17),  By  defining 

0 

(1-s) 

2 

(44) 

0 

(1+s) 

2 

The  matrix  for  each  side  may  be  readily  written  as: 

Side  1 L = [L,  i L.  i 0 I 0] 

^ ^ 

Side  2 = [0  i ; L2  i 0]  (45) 

Side  3 L = [0  i 0 : L,  : L-] 

Side  4 L = [L-  : 0 : 0 : LJ 

The  contribution  to  the  matrix  ^ from  each  side  is  now  given  by 

£=  2tt  ^(S)  (S)  r(S)  |j|  dS  (46) 

The  Jacobian,  | j| , is  simply  half  the  length  of  the  side  and  the  total  value 
of  ^ is  obtained  by  summing  the  contributions  of  Eq.  (46)  from  each  side.  A 
Gaussian  integration  rule  is  used  for  the  evaluation  of  Eq.  (46). 

When  the  matrices  ]1  and  £ have  been  formed,  and  has  been  inverted 
(note  that  is  symmetric  and  positive  definite) , the  element  stiffness  matrix, 

J^,  is  determined  by 

k = G^H'^G  (47) 

IV.  NUMERICAL  EXAMPLES.  In  order  to  examine  the  adequacy  of  the  element 


developed  in  this  study,  it  is  employed  to  analyze  two  problems  with  known 


analytical  solutions.  The  first  problem  is  the  torsion  of  a solid  cylinder 
(Fig.  3)  and  the  second  one  is  the  torsion  of  a truncated  cone  (Fig.  4). 

The  exact  solution  [6]  of  each  problem  is  given  here  along  with  the  figure. 


Torsion  of  Solid  Cylinder 


Fig.  4 

Torsion  of  Truncated  Cone 
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results  of  the  finite  element  solution  of  these  problems  con^ared 
exact  solutions  are  shown  in  Tables  1 and  2.  Study  of  the  convergence 

of  the  finite  element  solutions  versus  the  total  number  of  elements  used  is 
also  indicated  in  the  tables.  As  it  can  be  seen  that  an  excellent  comparison 
is  obtained  between  the  finite  element  solutions  and  the  exact  solutions  in 
the  solid  cylinder  case  for  the  various  mesh  sizes  employed.  In  the  case  of 
truncated  cone,  excellent  agreement  is  achieved  for  the  stress  in  the 
larger  mesh  size  case. 

It  should  be  added  here  that  other  forms  of  stress  assumptions  were  also 
studied  including  second  and  third  order  terms.  However,  less  than  satis- 
factory results  were  obtained. 
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TABLE  I 

Comparison  of  Finite  Element  Solution  and  Exact  Solution  (Solid  Cylinder) 


arisen  of  Finite  Element  Solution  and  Exact  Solution  (Truncated  Cone) 


10.5  I 2.813  I 141.27  I 141.91  I 139.62  I 17.66  ' I 16.33  I 13.17 
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ABSTRACT . The  line  method  of  analysis  is  applied  to  the  Navler- 
Cauchy  equations  of  elastic  equilibrium  to  calculate  the  displacement  fields 
in  finite  geometry  bars  containing  central,  surface,  and  double-edge  cracks 
under  extenslonally  applied  unifom  loading.  The  application  of  this  method 
to  these  equations  leads  to  coupled  sets  of  simultaneous  ordinary  differen- 
tial equations  whose  solutions  are  obtained  along  sets  of  lines  in  a dis- 
cretized region.  Normal  stresses  and  the  stress  intensity  factor  variation 
along  the  crack  periphery  are  calculated  using  the  obtained  displacement 
field.  The  reported  results  demonstrate  the  usefulness  of  this  method  in 
calculating  stress  intensity  factors  for  commonly  encountered  crack  geom- 
etries in  finite  solids. 

INTRODUCTION.  The  main  goal  of  fracture  mechanics  is  the  prediction 
of  the  load  at  which  a structure  weakened  by  a crack  will  fall.  Knowledge 
of  the  stress  and  displacement  distributions  near  the  crack  tip  is  of  funda- 
mental importance  in  evaluating  this  load  at  failure.  During  the  early 
development  of  crack  mechanics  most  of  the  effort  was  focused  on  through- 
thickness cracks  which  could  be  characterized  as  two-dimensional.  However, 
part-through  cracks  are  the  most  common  type  of  crack  defect  found  in  actual 
service  conditions  (ref.  1). 

Because  of  the  geometric  singularity  associated  with  any  crack  type 
problem,  only  limited  analytical  work  has  been  done  in  the  past  on  these 
problems.  Early  theoretical  solutions  for  three-dimensional  flaw  config- 
urations usually  involved  the  discussion  of  cracks  in  infinite  or  semi- 
infinite  solids  (refs.  2 to  8) . For  this  reason,  results  for  finite  geom- 
etry stress  intensity  factors  are  usually  given  in  terms  of  magnification 
factors  applied  to  some  convenient  reference  solution.  In  addition,  con- 
siderable scatter  exists  in  the  reported  results  as  obtained  by  different 
investigators  (ref.  9).  In  our  work  these  difficulties  are  avoided  by 
solving  the  finite  dimensional  problems  directly. 

Recently,  approximate  solutions  of  the  finite  geometry  surface  crack 
problem  were  obtained  by  the  boundary  integral  equation  method  (ref.  10) 
and  the  finite  element  method  (ref.  11).  An  alternate  semi- analytical 
method  suitable  for  the  elastic  solution  of  crack  problems  is  the  line 
method  of  analysis.  Successful  application  of  this  method  to  finite 
geometry  solids  containing  cracks  has  been  demonstrated  by  Gyekenyesl 
and  Mendelson  (ref.  12).  Although  the  concept  of  the  line  method  for 
solving  partial  differential  equations  is  not  new  (ref.  13),  its  appli- 
cation in  Che  past  has  been  limited  to  simple  examples.  The  basis  of 
this  technique  is  the  substitution  of  finite  differences  for  the  deriva- 
tives with  respect  to  all  the  independent  variables  except  one  for  which 

*This  article  has  been  issued  a NASA  Technical  Memorandum  No.  73717. 
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the  derivatives ,are  retained.  This  approach  replaces  a given  partial  dif- 
ferential equation  with  a system  of  simultaneous  ordinary  differential  equa- 
tions whose  solutions  can  then  be  obtained  In  closed  form.  These  equations 
describe  the  dependent  variable  along  lines  which  are  parallel  to  the  co- 
ordinate In  whose  direction  the  derivatives  were  retained.  Application  of 
the  line  method  Is  most  useful  when  the  resulting  ordinary  differential 
equations  are  linear  and  have  constant  coefficients. 

An  Inherent  advantage  of  the  line  method  over  other  numerical  methods 
Is  that  good  results  are  obtained  from  the  use  of  relatively  coarse  grids. 

This  use  of  a coarse  grid  Is  permissible  because. parts  of  the  solutions 
are  obtained  In  terms  of  contlnuotis  functions.  Additional  accuracy  In 
normal  stress  distributions  Is  derived  from  the  fact  that  they  are  ex- 
pressed as  first-order  derivatives  of  the  displacements  and  these  deriv- 
atives can  be  analytically  evaluated.  Inherently  Inaccurate  numerical 

differentiation  Is  required  only  for  evaluating  the  shear  stresses,  but  ] 

this  presents  no  Important  loss  of  accuracy  since  they  are  an  order  of 

magnitude  smaller  than  the  normal  stresses.  For  problems  with  geometric 

singularities,  additional  accuracy  Is  derived  from  using  a displacement 

formulation  since  the  resulting  deformations  are  not  singular. 

It  Is  the  purpose  of  this  report  to  present  a simple  and  systematic 
approach  to  the  elastic  analysis  of  three-dimensional,  finite  geometry  j 

solids  containing  traction-free  cracks.  The  need  for  these  specific  solu-  \ 

tlons  has  existed  for  a number  of  years  In  fracture  toughness  testing.  j 

j 

REDUCTION  OF  THE  NAVIER-CAUCHY  EQUATIONS  TO  SYSTEMS  OF  ORDINARY  ’ 

DIFFERENTIAL  EQUATIONS.  Within  the  framework  of  linearized  elasticity  I 

theory,  the  equations  of  elastic  equilibrium  In  terms  of  displacements 
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For  a finlt-:  geoaetry  solid  with  rectangular  boundaries,  we  construct 
three  sets  of  parallel  lines  (fig.  1(a)).  Each  set  of  lines  is  parallel 
to  one  of  the  coordinate  axes  and  thus  perpendicular  to  the  corresponding 
coordinate  plane.  An  approximate  solution  of  equation  (1)  can  be  ob- 
tained by  developing  solutions  of  ordinary  differential  equations  along 
the  x-directional  lines.  As  seen  in  the  figure,  ther  are  a total  of 
i =•  NY  X NZ  such  lines  where  NY  is  the  number  of  lines  along  the  y- 
direction  and  NZ  is  the  number  of  lines  along  the  z-directlon  in  a given 
plane,  respectively.  We  define  the  displaoenents  along  these  lines  as 
ui,  U2,  . . . , u^ . The  derivatives  of  the  y-directicnal  displacements 
on  these  lines  with  respect  to  y are  defined  as  v'j2,  . . .,  v'|^, 

and  the  derivatives  of  the  z-cirectional  displacements  with  respect  to 
z are  defined  as  w'j]_,  w'|2 These  displacements  and  deriva- 

tives can  then  be  regarded  as  functions  of  x only  since  they  are  vari- 
ables on  x-direccional  liitas.  '^hen  these  definitions  are  used,  the 
ordinary  differential  equation  along  a generic  line  ij  (a  double  sub- 
script is  used  here  for  simplicity  of  writing)  in  figure  1(h)  may  be 
written  as 
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dy  Iz 

Similar  differential  equations  are  obtained  along  the  other  x-directional 
lines.  Since  each  equation  has  the  terms  of  the  displacements  on  the  sur- 
rounding lines,  these  equations  constitute  a system  of  ordinary  differential 
equations  for  the  displacements  Uj^,  U2,  . . .,  u^. 

The  set  of  \ second  order  differential  equations  represented  by  (5) 
can  be  reduced  to  a set  of  2i  first  order  differential  equations  by  treating 
the  derivatives  of  the  u's  as  an  additional  set  of  i unknowns,  i.e., 
defining 


matrix  differential  equation 

^ + R(x)  (8) 

where  U and  R are  column  matrices  of  2i  elements  each  and  is  a 
2i  X 2\  matrix  of  the  constant  coefficients  appearing  In  equations  (5)  and 
(7). 

In  a similar  manner,  to  solve  equations  (2)  and  (3) , ordinary  dif- 
ferential equations  are  constructed  along  the  y-  and  z-dlrectlonal  lines 
respectively.  These  equations  are  also  expressed  In  an  analogous  form 
to  equations  (8);  they  are 

^-A2V+S(y)  (9) 


^ - A3W  + T(z)  (10) 


I 
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Equations  (8)  to  (10)  are  linear  first-order  ordinary  matrix  dif- 
ferential equations.  They  are,  however,  not  Independent,  but  are  coupled 
through  the  vectors,  R,  S and  T whose  components  are  given  by  equations 
similar  to  (6).  The  elements  of  the  coefficient  matrices  A^,  A2,  and 
A3  are  all  constants,  being  functions  of  the  mesh  spacing  and  Poisson's 
ratio  only. 

Noting  that  a second-order  differential  equation  can  satisfy  only 
a total  of  two  boundary  conditions  and  since  three-dimensional  elasticity 
problems  have  three  boundary  conditions  at  every  point  of  the  bounding 
surface,  some  of  the  boundary  data  must  be  Incorporated  Into  the  surface 
line  differential  equations.  Hence,  conditions  of  normal  stress  and  dis- 
placement are  enforced  through  the  constants  of  the  homogeneous  solutions 
while  shear  stress  boundary  data  must  be  Incorporated  Into  the  differential 
equations  of  the  surface  lines.  The  application  of  the  specified  shear 
conditions  permits  the  use  of  central  difference  approximations  when  sur- 
face line  differential  equations  are  constructed.  The  details  of  con- 
structing these  equations  are  found  In  reference  lA. 

SOLUTION  OF  THE  SYSTEMS  OF  ORDINARY  DIFFERENTIAL  EQUATIONS.  The 
systems  of  ordinary  differential  equations  (8)  to  (10)  can  be  solved 
by  any  of  a number  of  standard  techniques.  The  method  used  herein  was 
basically  the  matrlzant  or  Peano-Baker  method  of  Integration  (ref.  15). 

For  equation  (8)  the  solution  can  be  written  as 
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with  similar  solutions  for  equations  and  (10).  U(0)  is  the  initial 

value  vector,  determined  from  the  boundary  conditions.  The  conversion  of 
given  boundary  data  into  required  initial  values  is  discussed  in  more 
detail  in  reference  14. 

A X 

The  matrlzant  el  is  generally  evaluated  by  its  matrix  series. 

For  larger  values  of  x,  when  convergence  becomes  slow,  additive  formulas 
may  be  used.  In  addition,  similarity  transformations  can  be  used  to  diag- 
onalize the  matrix  A]^.  These  various  techniques  for  improving  the  ac- 
curacy are  discussed  in  detail  in  reference  14. 

Since  equations  (8)  to  (10)  and  their  boundary  conditions  are  highly 
coupled,  it  is  generally  impossible  to  directly  evaluate  their  solutions. 
Thus,  a successive  approximation  procedure  must  be  employed  where  assumed 
values  must  be  used  initially  for  the  required  unknowns.  The  cyclic 
resubstltutlon  of  the  obtained  solutions  into  the  coupling  vectors  and 
the  boundary  conditions  will  usually  converge  to  the  correct  solution, 
depending  mainly  on  the  accuracy  to  which  the  required  matrlzant  can  be 
evaluated. 


Once  the  successive  approximation  procedure  has  converged  and  the 
displacement  field  in  the  body  has  been  calculated,  the  normal  stress 
distributions  can  be  obtained  directly  by  using  the  stress-displacement 
equations.  The  shear  stresses,  however,  can  be  evaluated  only  through 
finite  difference  approximations  for  the  required  displacement  gradients. 

STRESS  INTENSITY  FACTOR.  The  stress  intensity  factor  Kj  was  at 
first  obtained  from  the  calculated  stresses  and  displacements  by  extending 
the  usual  definition 

^1  ‘ <^/2rR)"  (12) 


to  discrete  data,  where  R is  measured  from  and  is  normal  to  the  crack 
front  and  n is  the  singularity.  It  was  found,  however,  that  due  to  the 
coarseness  of  the  grid  used,  the  usual  plotting  and  extrapolating  techniques 
gave  results  that  were  erratic  and  of  questionable  accuracy.  This  was 
compounded  by  the  fact  that  the  precise  crack  tip  location  is  not  really 


known  except  that  it  is  approximately  midway  between  two  lines,  one  of 
which  has  zero  displacement  specified  in  the  crack  plane  and  one  of  which 
has  zero  stress  specified.  It  was  fotond,  however,  that  by  using  two  terms 
in  the  stress  and  displacement  series  expansions  around  the  crack  tip, 
good  results  could  be  obtained  even  with  the  coarse  grid  used.  Furthermore, 


this  also  permitted  us  to  determine  the  actual  crack  tip  location  from  the 
computed  results.  The  method  utilized  is  as  follows.  We  take 


where  a is  a function  of  Poisson's  ratio,  n was  assumed  to  be  -1/2 
and  r is  the  crack  edge  position  correction  measured  from  the  originally 
assumed  midpoint  position.  Using  displacement  data  from  three  adjacent 
nodes  to  the  crack  edge  in  equation  (13),  values  of  a K^,  Lj/K^,  and  r 
are  calculated  for  each  value  of  z,  with  R also  measured  from  the 
half-way  point  between  nodes  specifying  boundary  stresses  and  displacements, 
respectively.  Substituting  values  of  Lj/Kj  and  r into  equation  (14) , 
we  can  calculate  Kx  as  a function  of  the  corrected  crack  edge  distance, 

P R - r.  A plot  of  In  Ki  versus  as  -*■  0 can  then  be  used  to 

obtain  Kj.  In  a similar  manner,  a can  now  be  calculated  from  equation  (13), 
where  the  corrected  crack  edge  distance  with  the  displacement  data  is 
p “ R + r. 

APPLICATION  TO  TENSILE  FRACTURE  SPECIMENS  CONTAINING  CRACKS.  A great 
amount  of  experimental  work  has  been  done  in  fracture  mechanics  (ref.  16) 
through  the  use  of  crack-notched  specimens.  In  the  past,  many  different 
types  of  specimens  have  been  used  to  determine  a material's  fracture  toughness. 
The  most  common  early  specimens  employed  in  these  tests  were  the  center- 
cracked  and  double-edge-notched  bar  specimens.  Figures  2(a)  and  3(a)  show 
the  finite  rectangular  bars  with  through- thickness,  traction-free  central 
and  double-edge  cracks,  respectively.  Because  of  the  symmetric  geometry  and 
loading,  only  one-eighth  of  each  bar  has  to  be  discretized  as  shown  in 
figures  2(b)  and  3(b). 


NUMERICAL  RESULTS.  - Center-Cracked  Tensile  Fracture  Specimen.  The 
solution  of  this  problem  was  obtained  by  using  two  different  sets  of  lines 
along  the  coordinate  axes  so  that  the  convergence  of  the  finite  difference 
approximations  could  be  checked.  In  a given  direction,  uniform  line  spacing 
was  used  in  all  computations  with  no  other  restriction  being  placed  on  the 
selection  of  the  grid  size.  The  crack  edge  location  with  respect  to  the 
imposed  grid  was  initially  assumed  to  be  halfway  between  nodes  specifying 
normal  stress  and  displacement  boundary  conditions,  respectively.  Sub- 
sequently, using  the  obtained  near  crack  tip  stresses  and  displacements, 
a more  accurate  crack  edge  location  was  established  for  calculating  the 
stress  intensity  factor.  This  approach  was  considered  acceptable  since 


r 
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the  results  from  the  two  sets  of  lines  at  corresponding  points  did  not 
change,  although  the  crack  edge  to  node  distance  was  considerably  decreased 
for  the  finer  mesh.  The  successive  approximation  procedure  required  for 
decoupling  the  three  sets  of  ordinary  differential  equations  was  terminated 
when  the  difference  between  successively  calculated  nondlmenslonallzed 
displacements,  which  are  of  the  order  of  unity,  at  every  point  was  less 
than  a present  value  (10“^) . As  expected,  the  convergence  rate  of  this 


successive  epprozlmatlon  proceduu  was  greatly  dependent  on  the  Initial 
guess  for  the  required  unknowns  la  the  coupling  vectors  and  boundary  condi- 
tions. For  marlaun  computer  efficiency,  displacement  data  obtained  from  the 
use  of  coarse  grids  was  Interpolated  to  obtain  Improved  starting  values 
for  the  computations  Involving  the  final  spacing  of  lines.  The  required 
Initial  quantities  for  the  preliminary  coarse  grid  calcualtlons  were  taken  to 
be  zero  In  our  work.  All  calculations  were  performed  on  a UNIVAC  1100/40 
computer,  using  double  precision  arithmetic. 

For  the  selected  geometry,  the  crack  opening  displacements  and  normal 
stresses  from  our  analysis  and  those  from  Raju's  finite  element  method 
(ref.  17)  are  compared  In  Table  I.  Although  In  our  solution  of  the  gross 
displacement  and  stress  fields,  the  minimum  corrected  crack  edge  distance 
Is  p - 0.042c,  crack  opening  displacements  and  stresses  can  be  calculated 
from  equations  (13)  and  (14)  at  any  value  of  p for  which  these  equations 
are  assumed  to  be  accurate  (p  ~ 0.40c  or  less  In  this  problem).  As  seen 
from  Table  I,  there  Is  good  agreement  In  most  displacements,  with  the  normal 
stress  at  the  surface  showing  the  greatest  difference. 

The  dimensionless  crack  opening  displacement  Is  shown  In  figure  4. 
Agreement  with  the  finite  element  results  Is  seen  to  be  very  good.  It  Is 
noteworthy  that  the  results  correspond  to  elliptical  crack  profiles  In  all 
cases . 

An  Indication  of  the  accuracy  of  our  technique  for  computing  the  stress 
Intensity  factor  Is  seen  In  figure  5.  This  figure  shows  the  stress  Intensity 
factor  variation  across  the  bar  thickness.  The  results  obtained  In  ref- 
erence 17  using  a finite  elements  method  for  a geometry  almost  Identical 
to  one  of  the  geometries  In  this  paper  Is  also  shown.  It  Is  seen  that  very 
good  agreement  Is  obtained  between  two  completely  different  methods.  In 
addition,  Islda's  plane  solution  (ref.  18),  corrected  for  finite  width  and 
length.  Is  also  shown  In  this  figure  for  comparison.  Note  that  these  results 
Indicate  a small  Increas  In  Kj  at  the  surface  with  Increasing  bar  thickness. 
Interestingly,  for  bars  with  t > 3c,  Increases  gradually  with  z, 
reaching  a maximum  near  z > 0.85t,  and  then  decreases  rapidly  to  Its  surface 
value. 


Surface  Crack  Tensile  Fracture  Specimen.  Figure  6 shows  a finite  geom- 
etry bar  containing  a traction  free  rectangular  surface  crack.  Because  of 
the  synnetrlc  geometry  and  loading,  only  one-fourth  of  the  bar  has  to  be 
discretized  as  shown  In  figure  6(b). 

Selected  results  of  the  dimensionless  surface  crack  opening  displacements 
are  shown  In  figure  7.  Note  that  the  crack  opening  Increases  rapidly  with 
crack  depth  for  0.21  £ a/t  £ 0.87,  slightly  exceeding  even  the  surface 
crack  displacement  of  a through-thickness  crack  at  a/t  ■ 0.87.  The  plane 
strain  solution  for  a finite  width  center  cracked  bar  Is  also  shown  In  figure  7 
for  reference.  Final  displacement  values  In  this  report  were  obtained  from 
a set  of  100,  140,  and  140  x-,  y-,  and  z-dlrectlonal  differential  equations, 
respectively.  A typical  coiq>uter  run  for  this  system  of  equations  takes 
approximately  30  to  40  minutes  of  CPU  time  and  720  K bytes  of  storage. 
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In  order  to  show  the  singularity  of  the  stresses,  the  y-dlrectlonal 
normal  stress  In  the  crack  plane  Is  plotted  In  figure  8 for  a/t  « 0.536. 

The  results  clearly  Indicate  the  singular  nature  of  o along  the  crack 
periphery.  . ^ 

Double  Edge  Crack  Tensile  Fracture  Specimen.  Our  last  example  Is  the 
finite  bar  with  double  edge  cracks.  The  crack  opening  displacements  for  this 
problem  are  presented  In  figure  9.  The  stress  Intensity  factor  variation 
as  a function  of  bar  thickness  Is  shown  In  figure  10.  In  both,  results 
from  the  finite  element  method  are  shown  for  comparison.  Agreement  Is 
again  excellent. 

CONCLUSIONS.  The  line  method  of  analysis  presented  affords  a practical 
way  for  analysis  of  three-dimensional  crack  problems,  at  least  for  bodies 
with  reasonably  regular  boundaries.  Because  parts  of  the  solution  are  ob- 
tained as  continuous  functions  along  the  lines  chosen,  relatively  good 
accuracy  can  be  obtained  with  coarse  grids.  Results  of  the  analysis  Include 
the  displacements  and  normal  stresses  at  every  node  Inside  the  body  from 
which  the  stress  Intensity  factor  variations  were  easily  calculated.  In 
addition  It  should  be  noted  that  the  common  seml-elllptlcal  surface  crack 
problem  could  also  be  analyzed  by  merely  changing  the  boundary  conditions 
at  certain  nodes  In  the  crack  plane.  Introduction  of  plasticity  Into  the 
analysis  could  also  be  accomplished  by  changing  the  coupling  terms  In 
equations  (8)  to  (10).  Since  these  have  to  be  determined  by  an  Interatlve 
process  In  any  case.  It  would  seem  possible  to  solve  the  elastoplastlc  prob- 
lem by  a simple  extension  of  the  present  method.  Whether  this  approach  Is 
practical  requires  further  Investigation. 
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Figure  4.  - Crack  opening  displacement  for  center-cracked  bar  under 
uniform  tension. 
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Figure  5.  - Stress -Intensity  factor  variation  as  a function  of  bar  thick- 
ness for  a cenfer-crKked  rectangular  bar  under  uniform  tension. 


Figure  6.  - Bar  with  rectangular  surface  crack  under  uniform  tension. 
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Figure  8.  - Dimensionless  y-directlonal  normal  stress 
distribution  in  the  crack  plane  for  a bar  under  uniform 
tension  containing  a rectangular  surface  crack. 
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figure  9.  - Crack  opening  displacement  for  rectangular  bar  under 
uniform  tension  containing  through-thickness  double-edge 
cracks. 


Figure  10.  - Stress  intensity  factor  variation  as  function  of  bar  thick- 
ness for  rectangular  bar  under  uniform  tension  containing  double- 
edge cracks. 
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I ABSTRACT.  The  problem  considered  is  an  elastic-plastic,  annular 

T plate  radially  stressed  by  uniform  internal  pressure.  The  partially 

ji  plastic  deformation  problem  is  extended  to  the  fully  plastic  case.  The 

I plate  is  elastically  as  well  as  plastically  orthotropic  but  isotropic 

]l  in  its  plane.  The  exact  solution  is  obtained  on  the  basis  of  the  J2 

1 deformation  theory,  the  Hill's  yield  criterion  and  a modified  Ramberg- 

Osgood  Law.  Numerical  results  for  the  stresses,  strains  and  displace- 


ment will  be  presented  and  discussed. 

1.  INTRODUCTION.  The  problem  considered  is  an  elastic-plastic 
annular  plate  radially  stressed  by  uniform  internal  pressure.  For 
ideally  plastic  materials,  the  stress  solution  for  this  statically 
determinate  problem  was  first  obtained  by  Mises  [1]  and  the  correspond- 
ing two  strain  solutions  were  obtained  by  the  present  author  on  the 
basis  of  both  J2  deformation  and  flow  theories  [2].  For  elastic-plastic 
strain-hardening  materials,  an  exact  solution  for  the  partially  plastic 
deformation  problem  was  recently  reported  in  [3]  for  the  isotropic  case 
and  in  [4]  for  the  anisotropic  case.  Analytical  expressions  were 
derived  but  only  the  effect  of  geometric  ratio  on  the  stresses  was 
briefly  discussed  in  that  technical  note. 

In  the  present  paper,  the  partially  plastic  deformation  problem  is 
extended  to  the  fully  plastic  case.  A unified  treatment  is  given  here 
for  both  cases.  The  plate  is  elastically  as  well  as  plastically 
orthotropic  but  isotropic  in  its  plane.  The  material  model  is  assumed 
to  obey  the  J2  deformation  theory,  the  Hill's  yield  criterion  and  a 
modified  Ramberg-Osgood  law  [5] . The  exact  solution  will  be  presented 
and  the  effect  of  strain  hardening  on  the  stresses,  strains  and  displace- 
ment for  the  complete  range  of  loading  will  be  discussed. 

2.  BASIC  EQUATIONS.  Assuming  small  strains  and  neglecting  inertia 
forces  in  the  axi symmetric  state  of  plane  stress,  the  radial  and 
tangential  stresses,  Oj.  and  09,  must  satisfy  the  equilibrium  equation. 


00  = (d/dr)(raj.)  ; (1) 


and  the  corresponding  strains,  Ej.  and  £0  , are  given  in  terms  of  the 
radial  displacement,  u , by 


Ej.  = du/dr  , Eg  = u/r  . 


(2) 


We  shall  assume  that  the  plate  is  elastically  as  well  as  plastically 
orthotropic  but  isotropic  in  its  plane,  E = Ej.  = Eq,  v = = v, 


rilateS^ 


-r  - - '"re  “ '"er* 

V,..  = V02  are  the  elastic  constants,  and  R is  the  plastic  strain  ratio 
to  the  yield  stress  ratio  oj,  plastic  Poisson's  ratio  y-  by 

“ R/(l+R)-  According  to  the  simple  deformation  theory, 
the  strains  are  related  to  the  stresses  by 


Ej.  = E'l(aj.-vag)  + (Es‘^-E"^)(aj.-ypO0)  , 

Eg  = E'^Cog-vaj.)  + (Eg'^-E"^)  (ag-ypOr)  , C3) 


where  Eg  is  the  secant  modulus  on  the  effective  stress-strain  curve 
with  Eg  = a/e  and 

1/2 

a = ca^.^  + og^  - 2ypaj.CTg)  . (4) 


If  a modified  uniaxial  relation  of  the  Ramberg-Osgood  type  is  assumed  [S] , 
we  have 

Eg"^  = E"^  for  a $ Oy  ; Eg'^  = E ^(cr/Oy)"  ^ for  a > Oy  (5) 

and  the  initial  yield  surface  is  defined  by  the  ellipse  0 = Oy 

Since  the  compressibility  of  the  material  is  taken  into  account,  the 
longitudinal  strain  e^  can  be  determined  by 

Ej.  + Eg  + E^  = E'^(l-V-Vj)  (Oj.  + 0q)  , (6) 

which  holds  in  the  elastic  as  well  as  plastic  region. 

The  boundary  conditions  on  the  problem  are 

aj.Ca,  t)  = - P , aj.(b,  t)  = 0 , (7) 

Where  a,  b and  P are  the  inner,  outer  radii  and  internal  pressure, 
respectively,  and  t denotes  some  monotonic  parameter  such  as  P or  the 
elastic-plastic  boundary  p.  In  addition,  all  stresses,  strains  and 
displacement  must  be  continuous  throughout  the  entire  region. 
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In  the  following,  the  solutions  will  be  presented  in  terms  of 
nondimensional  quantities  defined  by 

a = a/b,  5 = r/b,  3 = P/b,  X = P/Oy  , 

Sj.  = aj./ay,  Sq  = Og/Oy,  S = cr/Oy  , 

” b€j»/cjy,  eg  = EGg/Oy,  e^  “ 

3.  SOLUTION  IN  THE  PLASTIC  REGION  fa  < g < g) • Following  Nadai 
for  isotropic  problems  [6],  we  introduce  the  parametric  representation 
(0  < 4)  < tt/2) 


Sj.  = -S  cos(j>/sin26  , 

Sg  = -S  cos(4)  + 26)/sin26  , (9) 

which  satisfies  equation  (4)  identically  and  leads  to  the  following 
equation  upon  substituting  into  the  equation  of  equilibrium, 

C"^d5  = [sin26(tan6  + tan(t>) ] "^  (tan<|)d4)  - S"^dS)  . (10) 

By  the  extended  Michell  theorem  [7],  the  stress  solution  for  the  present 
problem  is  independent  of  v.  So  choose  v = Up  in  (3)  a..d  make  use  of  (2), 
(5),  (8)  and  (10),  we  have 

5 = [-sin26(cot6  + cot(}i)]  ^(cot(|)d(|)  + nS'^dS)  . (11) 

The  above  two  ordinary  differential  equations  form  a system  and  can  be 
integrated  exactly  if  we  know  the  boundary  values  for  S and  at  ^ = a 
or  3.  These  values  at  the  two  boundaries  of  the  plastic  region  will  be 
determined  in  the  next  section.  Since  S and  (|)  are  functions  of  C and 
3.  the  notation  S^g  = S(5,3),  (|i^g  = <|>{e,3)  are  introduced.  After  some 
manipulation,  the  results  are  presented  in  the  following  form: 


where 


^53/^33  = 4>Bg)  . 

(3/a*  = F((t)pR,  <t)gg)  , 


n sin(|)gg  - cot6  cos<(igg 
n sin(|)^g  - cotd  cos(t)^g 


-rw 


exp 


(n-l)cot6 


n^+cot*fi 


(12) 


(13) 
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ncsc^($ 

n^+cot^6 


sinC(|)^g+6) 

sin((t)gg+6) 


n sin<t)gg 
n sin*^g 


- cot6  cos(j)gg 

- cot6  cos(J)^g 


X exp 


(n^-l]cot6 

n^+cot^6 


(14) 


V = (n  + cot^6)/(n^  + cot^6)  , 


and  6 is  the  anisotropic  parameter  defined  in  the  first  quadrant  by 


tan^a  = (1-Pp)/(1  + Up)  = 1/(1  + 2R)  = l/(4a)2-l)  , (15) 

The  solution  for  the  strains  in  the  plastic  region  (aj<5^B)  of  an 
elastic-plastic  plate  with  finite  n can  be  obtained  from  (3)  and  (6), 
using  (5),  (8)  and  the  above  stress  solution.  After  some  manipulation, 
the  equations  for  the  dimensionless  strains  can  be  written  as 


e_  = -Spo  sin((t)^g+26)  -S^gCos((fi^g+26)  (cos26-v)/sin26  , 


®0  ~ ^50  sincjj^g  - cos(})^g(cos26-v)/sin26  . 

= [(2sin6)Srg  - (2sin6-V2^/sin6)Srg]  cos(<tipg+6) 


(16) 


4,  DETERMINATION  OF  THE  BOUNDARY  VALUES.  For  small  internal 
pressure  (X  ^ X*),  the  plate  will  be  elastic  throughout  (a  ^ ^ ^ 1)  and 
the  solution  is  omitted  here.  The  critical  value  X*  to  cause  incipient 
plastic  deformation  is 

-1/2 

X*  = (1-a^)  [2  (1  + pp)  + 2 (1  - Pp)a-]  . (17) 

For  values  of  p larger  than  p*,  the  plate  becomes  plastic  in  the  inner 
region. 


The  foregoing  solution  in  the  plastic  region  (a  5 £ B)  has  been 

derived  on  the  assumption  that  we  know  the  boundary  values  for  S and  ()) 

at  5 = B.  These  values  at  the  two  boundaries  at  ^ = a and  0 will  be 

determined  in  this  section  for  the  partially-plastic  as  well  as  fully- 

plastic  case. 


A.  Partially-Plastic  Deformation  (X*  ^ X £ X**) . In  this  case,  the 
plate  will  be  plastic  in  the^Hflher  region  (a  £ 5 £ B)  and  still  elastic  in 
the  outer  region  (B  £ ? £ 1) . In  the  outer  elastic  region,  the  equations 
for  the  dimensionless  stresses  and  strains  are 


(1  + 5 ^)/(sin^6  + B”"*  cos^6)^^^ 


■j  [(1  - v)  + (I  + v)5  ^]/(sin^6  + B ^ cos^6)^^^ 


e^  = - Vj/(sin^6  + B ^ cos^6)  . (18) 

The  above  equations  together  with  the  continuity  requirement  of  the  stresses, 
strains  and  displacement  at  the  elastic-plastic  interface  lead  to 

S^B  = 1 and  , (19a) 


+ an(()oo  = (B^tand  + cot6)/(l-B^) 


At  the  inside  surface,  ^ = a, 
condition  (7)  reduce  to 


, equations  (12)  with  the  boundary 


(B/a)"  = F((l)^g  , ())gg)  , (19d) 

and  ’ ‘*’BB^  ‘ 

The  five  equations  (19a  to  e)  are  sufficient  to  determine  the  five 
unknowns  B,  <t)gg,  Sgg,  0 g,  S g in  terms  of  X.  Alternatively,  we  can 
determine  X,  9ggj  in  terms  of  B.  The  latter  approach  has 

been  chosen  in  obtaining  numerical  results.  The  partially-plastic 
deformation  reaches  its  upper  limit  when  B increases  to  1.  The  correspond- 
ing pressure  factor  is  X**. 

B.  Fully-Plastic  Deformation  (X  ^ X**).  In  this  case,  the  plate 
will  be  plastic  in  the  entire  region  (a  ^ 5 £ B)  and  B = 1.  Equations  (9) 
and  (12)  with  the  boundary  conditions  (7)  reduce  to 

= tt/2  , (20a) 

X = <J)(ji/sin26  , (20b) 


region  can  be  calculated  as  shown  in  the  preceding  section. 

5.  DISCUSSIONS  OF  RESULTS.  Since  the  deformation  theory  is  used, 
the  validity  of  the  above  solution  should  be  assessed  by  applying 
Budiansky's  criterion  [8]  which  requires  the  following  inequality  to  be 
satisfied, 

[ (nS^g^-l)/(S^g^-l)]  tan(j)^g  tan6-l)/(tan())^g  + tan6) . (21) 

It  has  been  verified  numerically  that  all  the  values  of  and  (J)^g 
reported  here  satisfy  the  inequality.  We  may  discuss  the  effects^of 
elastic  constants  (E,  v,  v,),  plastic  material  constants  (a  , n,  R)  and 
geometric  ratio  (a/b)  on  the  stresses,  strains  and  displacement  for 
various  values  of  pressure  factor  (P/Oy) . Some  typical  results  for  the 
distribution  of  stresses  and  strains  in  the  2219-T87  aluminum  plate  with 
E = 10.5  X 10®  psi,  0 = 0]^=  .3,  Oy  = 5.5  x 10“  psi,  n = 9,  R = 1,  b/a  = 3 
were  presented  in  [3] . The  effects  of  geometric  ratio  on  the  stresses  at 
the  inside  surface  of  several  partly-plastic  plates  with  v = Vj  = .3, 
n = 9,  R = 1 and  b/a  = 2,3,4,10  were  reported  in  [4].  Neither  the 
numerical  results  for  the  strains  and  displacement  nor  the  effect  of  strain 
hardening  and  anisotropy  have  been  discussed.  In  the  following  additional 
results  will  be  given. 

The  emphasis  herein  is  on  the  effect  of  strain  hardening  in  a partly- 
plastic  as  well  as  fully-plastic  annular  plate  radially  stressed  under 
uniform  internal  pressure.  In  order  to  compare  with  the  elastic-perfectly- 
plastic  solution  given  in  [2],  we  choose  b/a  = 2,  v = = 0.3,  R = 1 and 

n = 3,9,15,  “>  . The  numerical  results  for  the  stresses,  strains  and 
displacement  are  presented  in  graphical  form  for  the  complete  range  of 
loading.  For  the  partially  plastic  case,  all  the  results  can  be 
expressed  as  functions  of  radius  of  elastic-plastic  boundary  p.  These 
are  shown  in  Figures  1-4.  The  effect  of  strain  hardening  (n)  on  the 
radial  stress  or  pressure  at  the  inside  surface  for  various  sizes  of 
plastic  zone  is  shown  in  Figure  1 and  the  corresponding  effect  on  the 
circumferential  stress  at  the  inside  surface,  in  Figure  2.  The  effect 
of  strain  hardening  on  the  stresses  is  quite  significant  as  shown  in 
the  above  figures.  The  differences  become  larger  as  the  size  of  plastic 
zone  expands.  It  is  interesting  to  point  out  that  the  stresses  at  the 
inside  surface  corresponding  to  initial  yielding  is  independent  of  strain 
hardening  but  depend  on  geometric  ratio  and  anisotropic  parameter  as 
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shovm  in  [4] • The  numerical  results  for  the  radial  and  axial  strains  at 
the  inside  surface  for  various  si^es  of  plastic  zone  are  presented  in 
Figure  3 and  the  corresponding  results  for  the  radial  displacement  or 
circumferential  strain  at  the  inside  surface,  in  Figure  4.  The  differences 
for  all  strain  components  become  larger  as  plastic  zone  expands,  however, 
those  differences  for  circumferential  strain  are  very  small.  In  Figure  4, 
we  also  present  the  results  for  the  residual  circumferential  stress  at 
the  inside  surface  as  a function  of  radius  of  elastic-plastic  boundary. 

The  results  were  obtained  by  assuming  elastic  unloading  when  the  pressure 
corresponding  to  any  size  of  plastic  zone  was  removed  completely.  As 
shown  in  Figure  4,  the  effect  of  increased  strain  hardening  (i.e.,  a 
smaller  value  of  n)  is  to  decrease  the  magnitude  of  the  compressive 
residual  circumferential  stress.  Since  the  smaller  value  of  this 
magnitude  is  less  favorable  and  larger  pressure  is  required  to  reach  it, 
we  conclude  from  this  point  of  view  that  the  strain  hardening  effect  is 
undesirable.  Other  considerations  such  as  reduced  ductility  and  reduced 
fatigue  resistance  also  support  this  conclusion.  The  best  candidate  is 
a high  strength  material  with  little  strain  hardening. 

For  the  fully  plastic  case,  the  results  are  no  longer  functions  of 
radius  of  elastic-plastic  boundary  p but  they  can  be  expressed  as  functions 
of  internal  pressure  or  radial  expansion  as  presented  in  Figures  5 to  8. 

In  all  these  figures,  the  results  for  the  complete  range  of  loading  are 
given  with  broken  lines  for  elastic  ranges,  solid  curves  for  partially- 
plastic  ranges  and  dotted  curves  for  fully  plastic  ranges.  The  radial 
and  circumferential  stresses  at  the  inside  surface  for  several  values  of 
n are  shown  in  Figure  5.  The  effect  of  strain  hardening  (n)  on  the 
stresses  is  quite  significant  as  shown  here.  For  large  stresses,  the 
curves  for  finite  n in  this  figure  approach  radially.  The  numerical 
results  for  the  radial  and  axial  strains  at  the  inside  surface  are 
presented  in  Figure  6 and  the  corresponding  results  for  the  radial 
displacement  or  circumferential  strain  at  the  inside  surface,  in  Figure  7. 

The  effect  of  strain  hardening  (n)  on  the  strains  and  displacement  can  be  ' 

seen  from  these  two  figures.  We  also  present  the  results  for  the 

residual  circumferential  stress  at  the  inside  surface  as  a function  of 

inside  radial  expansion  in  Figure  8.  Again,  the  assumption  of  elastic 

unloading  is  used  after  removing  the  pressure  completely.  It  should  also 

be  noted  that  the  present  solution  has  been  obtained  on  the  basis  of 

small  strain  assumption.  All  the  strain  results  reported  here  are  indeed 

small  since  they  must  be  multiplied  by  the  yield  strain,  Oy/E,  which  can 

take  on  a very  small  value.  However,  for  a given  material  with  a given 

yield  strain  the  validity  of  the  small  strain  approximation  may  be 

violated.  The  large  strain  approach  to  this  problem  should  be  examined. 


Ill 
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Fig.  5 Circumferential  stress  at  the  inner  boundary  as  function 


Fig.  6 Radial  and  axial  strains  at  the  inside  surface  as  functions  of  internal  pressure 
(b/a  = 2,  V = V,  = .3,  R = 1). 


as  functions  of  inside  radial  displacement  (b/a 


Fig.  8 Residual  circumferential  stress  at  the  inside  surface  as  functions  of  radial 
expansion  (b/a  = 2,  v = Vi  = .3,  R=n. 
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COMPUTER  SIMULATION  OF  SHOCK  PROPAGATION 
IN  THE  ONE-DIMENSIONAL  LATTICE* 


John  D.  Powell  and  Jad  H.  Batteh 
Physical  Sciences  Branch 
Ballistic  Modeling  Division 
US  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  Maryland 


ABSTRACT.  The  equations  of  motion  for  the  atoms  in  a one-dimensional 
lattice  subjected  to  steady  shock  compression  are  solved  numerically. 

The  atoms  are  assumed  to  interact  through  a nearest-neighbor,  Morse- 
type  potential.  The  effect  of  the  initial  state  of  the  lattice  upon 
the  shock  profile  is  studied  by  considering  two  sets  of  initial  con- 
ditions. In  the  first,  the  atoms  are  at  rest  in  their  equilibrium 
positions  prior  to  compression  by  the  shock  wave;  in  the  second,  the 
lattice  is  initially  in  thermal  equilibrium  at  approximately  room 
temperature.  The  lattice  is  found  to  support  the  propagation  of  well- 
defined,  stable  pulses  (solitons)  and  the  physical  implications  of 
these  pulses  are  discussed. 

I.  INTRODUCTION.  In  the  past  decade  or  so,  computer-molecular- 
dynamic  techniques  have  been  used  by  a number  of  different  investigators 
to  study  shock  propagation  in  discrete  crystal  lattices.  The  work  has 
been  motivated  by  a belief  that  the  usual  continuum  approximation  may, 

at  least  in  certain  cases,  fail  to  describe  the  effect  adequately.  These 
investigations  have,  in  our  opinion,  raised  but  failed  to  answer  some 
important  questions  regarding  the  existence  of  steady  state  and  the 
approach  to  thermal  equilibrium  behind  the  shock  front.  For  this  reason, 
and  because  of  the  importance  of  shock  propagation  to  detonation  theory, 
we  have  in  the  last  few  months  initiated  a program  in  computer  molecular 
dynamics.  As  an  initial  effort,  we  developed  in-house  a computer  code 
which  solves  the  atomic  equations  of  motion  for  a one-dimensional  lattice 
subjected  to  shock  compression.  The  results  for  the  one-dimensional  case 
are  of  some  interest  in  themselves,  but  this  case  really  represents  only 
an  initial  effort  in  the  development  of  a full  three-dimensional  code. 

We  now  describe  the  results  of  the  calculations,  emphasizing  particularly 
how  and  why  they  differ  from  the  continuum  results. 

II.  MODEL  AND  EQUATIONS.  The  model  employed  is  shown  in  Fig.  1. 

It  consists  of  a one-dimensional,  monatomic,  chain  of  N atoms,  each 
having  mass  m,  which  interact  through  some  interatomic  potential.  At 
time  T=0,  the  first  particle  is  subjected  to  steady  compression  at  a 
nondimensional  velocity  of  unity  - it  is  unity  because  we  normalize  all 
the  velocities  in  the  calculation  to  the  compression  velocity  - and  this 
compression  produces  a shock  wave  which  propagates  through  the  crystal. 


• For  more  detailed  discussion,  see  John  D.  Powell  and  Jad  H.  Batteh, 
"Shock  Propagation  in  the  One-Dimensional  Lattice",  BRL  Report  (to  be 
published) . 
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considered  several  forms  of  interatomic  potential  but,  for  purposes  of 
discussion,  will  consider  only  the  Morse  interaction  here. 

The  differential  equation  of  motion  for  the  particle  in  the 
lattice  is  just  given  by  Newton's  second  law,  namely. 


= I exp  [-2A(S^-S._j)]  - exp  [ -A(S  . -S  . _p] 
- exp  [-2A(S.^^-S.)]  + exp  [ -A(S.^^-S.)]  | 


• th 


In  this  expression,  S.  is  the  nondimensional  displacement  of  the  j 
particle,  T the  nondiAensional  time,  and  A a parameter  which  represents 
the  nonlinearity  or  anharmonicity  of  the  lattice.  Hereafter,  it  will  be 
referred  to  simply  as  the  nonlinearity  parameter.  In  this  equation,  we 
assume  that  only  nearest-neighbor  interactions  are  important  - i.e., 
that  only  the  j -1  st  and  j + 1 st  particles  exert  an  appreciable  force 

on  the  j 


• th 


There  is,  of  course,  an  equation  such  as  this  for  every  particle 
(several  hundred  in  our  calculation),  and  the  equations  are  coupled 
because  the  force  exerted  on  a given  particle  depends  upon  the  position 
of  those  adjacent  to  it. 

III.  THE  INITIALLY  QUIESCENT  UTTICE.  In  our  initial  calculations, 
we  assumed  for  simplification  that  all  atoms  in  the  lattice  were  initially 
at  rest  in  their  equilibrium  positions  prior  to  being  excited  by  the  shock 
front.  The  results  of  the  calculations  can  be  understood  most  easily  by 
comparing  the  velocity-time  trajectories  for  several  particles  in  the 
lattice  and  determining  what  aspects  of  the  general  shock  profile  can  be 
understood  therefrom.  Results  are  for  the  case  A = 1.0. 

Therefore,  in  2,  we  consider  initially  the  velocity-time 

trajectory  of  the  25  " particle  in  the  lattice  subsequent  to  its 
excitation  by  the  shock.  In  all  graphs  we  have  arbitrarily  readjusted 
the  time  axis  so  that,  at  time  T=0,  the  particle  first  feels  the  effect 
of  the  shock.  Prior  to  that  time,  it  is  at  rest  in  its  equilibrium 
position.  Just  behind  the  front,  we  find  that  the  velocity  of  the 
particle  varies  along  rather  well-defined  pulses  whose  amplitude  decreases 
with  increasing  time.  Although  it  is  not  shown  in  the  graph,  the  ampli- 
tude of  the  velocity  oscillations  eventually  approaches  some  asymptotic 
value  which  increases  with  increasing  A.  If  we  view  a particular  particle, 
then,  these  pulses  appear  to  propagate  into  the  lattice  from  the  end  at  which 
compression  is  occurring  and  it  is  found  that  the  speed  with  which  they 
propagate  decreases  with  decreasing  amplitude.  Thus,  they  tend  to  |pread 
apart  as  they  propagate.  By  the  time  the  shock  has  reached  the  500^ 
particle  in  the  lattice,  shown  on  the  lower  half  of  the  figure,  we  find 
extremely  well-defined  pulses,  having  reached  an  amplitude  of  about  2, 


123 


in  the  vicinity  of  the  front.  The  pulses  are  more  distantly  spaced  in 
time  than  when  the  shock  front  was  at  the  25^  particle  owing  to  the 
spreading  effect  just  mentioned.  These  well-defined  pulses  have  been 
observed  to  propagate  in  other  nonlinear,  dispersive  media;  they  main- 
tain their  shapes  upon  collision  with  one  another  (demonstrated  in  the 
next  section)  and  are  referred  to  as  solitons.  Physically,  they  repre- 
sent a balance  between  the  nonlinearity  which  tends  to  steepen  the  pulse 
and  dispersion  which  tends  to  spread  it  out.  Dispersion  is  simply  a 
characteristic  property  of  discrete  lattices  in  which  the  higher-frequency 
components  in  the  pulse  propagate  more  slowly  than  the  lower- frequency 
components . 

The  conclusion  for  this  case,  then,  is  that,  unlike  as  is  assumed 
in  continuum  theories,  the  profile  is  not  steady  because  of  the  spreading 
effect  and  the  lattice  does  not  approach  thermal  equilibrium  at  a higher 
temperature  behind  the  shock  front  because  of  the  propagation  of  well- 
defined  pulses.  Essentially,  all  the  energy  deposited  into  the  lattice  by 
the  shock  wave  is  propagated  in  the  form  of  solitary  waves  and  no  energy 
remains  for  thermalization.  Results  similar  to  this  have  been  obtained 
by  Tasi*  for  the  case  of  a slightly  nonlinear,  cubic  interatomic  potential 
for  which  he  solved  the  equations  of  motion  using  an  elegant  perturbation 
technique . 

IV.  THE  LATTICE  AT  NONZERO  INITIAL  TEMPERATURE.  The  calculations 
of  the  preceding  section  are  somewhat  of  a compromise  to  physical  reality 
because,  as  was  stated  previously,  it  was  assumed  that  initially  each 
atom  was  at  rest  in  its  equilibrium  position.  It  was  therefore  desirable 
to  next  determine  the  effect  of  nonzero  ambient  temperature  upon  the 
propagation  of  the  solitary  waves.  For  this  reason,  we  modified  our 
code  to  account  for  a nonzero  initial  temperature.  Specifically,  we 
randomly  assigned,  according  to  a Maxwellian  distribution,  velocities 
to  all  the  particles  in  the  lattice.  The  initial  energy  of  the  lattice 
was  equivalent  to  that  in  a corresponding  lattice  in  thermal  equilibrium 
at  roughly  room  temperature.  The  lattice  was  then  allowed  to  oscillate 
freely  and  we  employed  several  checks  to  ensure  that  it  was  in  a state 
of  thermal  equilibrium.  We  then  subjected  this  lattice  to  shock  com- 
pression in  the  same  manner  as  before. 

In  Fig.  3 we  show  the  velocity-time  trajectory  of  the  145^*^  particle 
in  the  lattice  just  after  the  shock  front  has  reached  it.  Again  the 
shock  front  arrives  at  time  T=0  on  this  arbitrary  time  scale.  There  is 
some  suggestion  of  solitary  waves  propagating  in  the  lattice  but,  owing 
to  the  thermal  background,  the  shape  of  the  pulses  is  not  so  well  defined 
as  in  the  previous  case.  In  fact,  though  it  is  perhaps  not  evident  in 
this  figure,  in  some  cases  we  studied  the  shapes  of  the  pulses  were  so 
poorly  defined  that  we  were  unable  to  determine  whether  they  were  actually 
solitons  or  simply  large  perhaps  unstable  variations  in  the  thermal 
background.  In  an  effort  to  resolve  this  point,  we  redid  the  calculation 
and  instantaneously  stopped  the  compression  when  the  shock  wave  was  at 


* J.  Tasi,  .1.  Appl.  Phys.  43,  4016  (1972);  4569  (1973);  U, 

2245  (1973). 
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the^^40^  particle.  We  then  set  every  atom  in  the  lattice  beyond  the 
140^  at  rest  in  its  equilibrium  position  so  that  we  essentially  had 
the  shock-compressed  lattice  next  to  a cold  lattice.  Our  belief  was 
that  the  thermal  background  would  proceed  slowly  into  the  cold  lattice, 
whereas  the  high-amplitude  solitons  should  propagate  in  rapidly  and  we 
could  therefore  separate  the  two  effects. 


The  result  of  the  calculation  is  sl^wn  in  Fig.  4 in  which  is  plotted 
the  velocity-time  trajectory  of  the  ^^5^  particle  after  the  com- 
pression had  been  stopped  at  the  140  . Note  that  we  indeed  find  solitons, 
of  varying  amplitude  this  time  but,  near  the  front,  the  amplitudes  are 
higher  than  in  the  case  for  the  initially  quiescent  lattice.  If  we  wait 
a time  much  longer  than  is  shown  on  the  graph,  we  will  eventually  see 
the  thermal  background  from  the  hot  shock -compressed  lattice  propagating 
into  the  cold  lattice.  As  before,  we  found  that  the  pulses  propagate  at 
a rate  which  increases  with  increasing  amplitude. 

Since  the  solitons  have  velocities  that  vary  with  amplitude,  one 
can  easily  calculate  at  what  point  in  the  lattice  two  solitons  should 
be  coincid|j\t  and  interact.  In  Fig.  5 we  show  such  an  interaction. 

At  the  170  particle  the  solitons  are  still  separated  and  propagating 
to  the  left.  The  higher-amplitude,  faster-moving  pulse  is  b|Jiind  the 
slower.  By  the  time  the  solitons  have  propagated  to  the  185^,  particle, 
a nonlinear  interaction  is  occurring  and  finally,  at  the  200^  particle, 
the  pulses  have  separated  and  assumed  their  original  shapes.  The 
resultant  disturbance  in  the  region  of  interaction,  of  course,  is  not 
a simple  linear  superposition  of  separate  amplitudes  because  the  differ- 
ential equations  are  nonlinear.  Consequently,  we  see  that  the  solitons 
are  extremely  stable  entities  that  do  not  scatter  irreversibly  even  when 
they  collide  with  one  another. 


From  the  calculation,  we  can  conclude  that  owing  to  the  spreading 
effect  of  solitons  of  different  propagation  velocities,  the  profile 
is  again  nonsteady  in  time.  Despite  the  fact  that  solitons  propagate 
in  the  vicinity  of  the  front,  it  is  interesting  to  ask  if  any  thermal- 
lization  of  the  energy  deposited  by  the  shock  wave  occurs.  Unfortunately, 
at  this  point  we  have  no  complete  answer  to  this  question  but  the  evidence 
seems  to  suggest  that  it  does.  Specifically  we  have  calculated  the 
velocity  distribution  function  for  atoms  which  were  well  behind  the  shock 
front  at  several  different  times  and  found  that  it  is  approximately  Max- 
well-Boltzmann  and  remains  essentially  constant  in  time.  The  distribu- 
tion function  consistently  corresponds  to  a temperature  that  is  about 
three  times  the  ambient  temperature  of  the  lattice.  Our  tentative  specu- 
lation then  - and  it  is  only  speculation  at  this  point  - is  that  the 
perturbation  induced  by  the  initial  thermal  oscillations  will  prevent  the 
formation  of,  or  perhaps  lead  to  the  decay  of,  some  of  the  solitons  that 
might  otherwise  form  and  that  this  energy  eventually  becomes  thermal! zed. 
Additional  study  of  the  effect  will  be  necessary  before  firm  conclusions 
can  be  drawn. 
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SUMMARY  AND  CONCLUSIONS.  To  summarize,  we  have  found  that  in 
the  one-dimensional  case  the  propagation  of  solitons  tends  to  prevent 
the  rapid  establishment  of  thermal  equilibrium  behind  the  shock  front, 
and  the  spreading  effect  prevents  the  shock  profile  from  approaching  a 
steady  state  as  is  generally  assumed  in  continuum  theories  of  shock 
propagation.  Consequently,  the  transition  region  between  the  equili- 
brated region  of  the  crystal  ahead  of  the  front  and  that  behind  the 
front  grows  as  the  shock  propagates  into  the  crystal. 

In  the  near  future,  we  intend  to  extend  the  calculations  to  three 
dimensions  and  to  more  realistic,  perhaps  impure,  crystals  to  see  if 
similar  effects  persist  in  that  case.  If  so,  we  believe  that  these 
effects  may  be  significant  in  the  study  of  shock-induced  detonations. 

Most  current  theories  assume  that  the  transition  region  can  be  ignored 
and  that  the  only  effect  which  the  shock  has  is  to  raise  the  temperature, 
pressure,  and  density  of  the  crystal  to  values  higher  than  the  ambient 
values.  Chemical  reactions  then  occur  in  a thermally  equilibrated  part 
of  the  crystal.  It  would  appear,  however,  that  since  the  transition 
region  grows,  chemical  reactions  may  occur  in  a region  characterized  by 
extreme  nonequilibrium  and  it  is  important  to  assess  the  effects  of  this 
environment  upon  chemical  - reaction  rates.  It  is  unlikely,  for  instance, 
that  such  rates  can  be  characterized  by  an  Arrhenius  - type  relation 
whose  validity  is  dependent  upon  the  existence  of  thermal  equilibrium. 

We  hope  to  investigate  this  problem  in  greater  detail  once  the  calcu- 
lations have  been  extended  to  three  dimensions. 
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A PERTURBATION  EXPANSION  OF  THE  NAyiER-STOKES 
EQUATIONS  FOR  SHOCK  WAVES 

Jad  H.  Batteh  and  John  D.  Powell 
Physical  Sciences  Branch 
Ballistic  Modeling  Division 
US  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  MD  21005 

ABSTRtXCT ■ A perturbation  expansion  is  developed  for  the  steady- 
state  Navier-Stokes  equations  describing  one-dimensional  shock  propa- 
gation in  an  ideal  gas.  The  temperature  dependence  of  the  viscosity 
and  thermal  conductivity  is  accounted  for,  though  the  specific  heat 
and  Prandtl  number  are  assumed  constant.  It  is  shown  that  if  the  first 
n-I  terms  of  the  expansion  are  known,  the  solution  for  the  nth  term 
can  be  reduced  to  a quadrature.  The  expansion  is  evaluated  explicitly 
to  second  order  in  the  shock  strength.  A comparison  of  the  second- 
order  approximation  with  a special-case,  exact  solution  indicates  good 
agreement  even  for  rather  strong  shock  waves.  The  perturbation  solu- 
tion provides  a simple,  analytic  technique  for  determining  the  effect 
of  temperature-dependent  transport  coefficients  on  the  structure  of 
weak  shock  waves. 

I.  INTRODUCTION.  For  sufficiently  weak  shock  waves,  the 
variation  of  the  flow  variables  through  the  front  can  be  determined 
from  the  Navier-Stokes  equations.  In  general,  these  equations  are  too 
complicated  to  solve  exactly  and  one  must  usually  resort  to  numerical 
techniques.  It  is  desirable  to  supplement  the  numerical  solutions 
with  approximate,  analytic  solutions  wherever  possible  since  they 
permit  simpler  calculations  for  the  range  in  which  they  are  valid  and 
generally  lead  to  greater  physical  insight.  In  this  paper,  we  will 
describe  a perturbation  expansion  we  have  developed  for  solving  the 
steady-state  Navier-Stokes  equations  describing  the  propagation  of  a 
planar  shock  wave  in  an  ideal  gas.  The  approximate  solution  we  obtain 
from  the  expansion  offers  a simple,  analytic  technique  for  evaluating 
the  effect  of  temperature-dependent  transport  coefficients  on  the 
structure  of  weak  shock  waves. 

Figure  1 shows  a typical  shock  profile  in  the  co-ordinate  frame 
in  which  the  shock  wave  is  stationary.  In  the  laboratory  frame,  the 
shock  actually  propagates  into  a stationary  gas  with  a speed  u^  in 

the  negative  x-direction.  As  it  propagates,  the  shock  wave  compresses 
and  heats  the  gas  and  sets  it  in  motion  so  that  the  density  and 


A more  detailed  treatment  of  this  presentation  can  be  found  in 
Reference  (1). 
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temperature  behind  the  front,  denoted  by  and  T^,  respectively,  are 

greater  than  the  corresponding  values,  and  T^,  ahead  of  the  front. 

In  the  stationary  frame,  the  flow  velocity  behind  the  front,  u^,  is 

less  than  that  ahead  of  the  shock  wave.  The  width  of  the  shock  front 
is  typically  several  molecular  mean  free  paths  and  decreases  as  the 
strength  of  the  shock  increases. 

. It.-  THE  FLOW  EQUATIONS.  The  shock  profile  in  the  stationary 
frame  is  determined  by  the  steady- state  hydrodynamic  equations  which 
express  the  conservation  of  mass,  momentum  and  energy: 


(1) 

(2) 


^ ^ + p ^ = 

dx  dx  XX  dx 


(3) 


In  these  equations,  e is  the  specific  internal  energy  of  the  gas,  Q 
is  the  energy  dissipated  due  to  heat  conduction  and  is  the  appropri- 
ate element  of  the  pressure  tensor. 

Equations  (1)  - (3)  are  general  and  exact,  but  they  contain 
insufficient  information  to  determine  the  flow  variables.  To  close 
the  set  of  equations,  it  is  necessary  to  specify  a form  for  the  thermal 
conduction  and  the  pressure  element,  as  well  as  an  equation  of  state. 

For  sufficiently  weak  shock  waves,  where  the  gradients  in  the  flow 
variables  are  small,  the  thermal  conduction  is  adequately  represented 
by  Fourier's  Law 


(4) 


where  k is  the  thermal  conductivity  and  the  pressure  term  by  Stokes' 
hypothesis 


P 

XX 


p - 4/3  P ^ , 


(5) 


where  P is  the  pressure  and  p is  the  viscosity.  The  conservation 
equations,  together  with  Fourier's  Law  and  Stokes'  hypothesis,  are 
generally  referred  to  as  the  Navier-Stokes  equations.  Finally,  we 
assume  for  this  analysis  a perfect  gas  so  that  the  equation  of  state 


i 


133 


I 


can  be  written  as 


P = ^ P T C6) 

where  is  the  specific  heat  at  constant  pressure  and  y is  the  ratio 

of  that  specific  heat  to  the  one  at  constant  volume.  The  values  of 
Cp  and  Y are  assumed  to  remain  constant  during  the  shock  compression. 


It  should  be  emphasized  that  the  Navier-Stokes  equations  are 
valid  only  for  relatively  weak  shock  waves.  For  strong  shocks,  the 
gradients  of  the  flow  variables  become  large  and  this  fact  precludes 
the  use  of  Fourier’s  Law  and  Stokes'  hypotehsis.  One  must  then  resort 
to  more  complicated  analyses  based  on  the  Boltzmann  equation. 


It  is  convenient  to  define  a shock  strength  parameter,  e,  according 
to 


e 


M^-1 

yM^+1 


(7) 


where  M is  the  Mach  number  given  by  the  ratio  of  the  shock  speed  u^  to 

the  sound  speed  in  the  gas,  c.  Tne  Mach  number  approaches  unity  for 
weak  shock  waves  and  increases  without  bound  with  increasing  shock 
strength.  Consequently,  e approaches  zero  for  weak  shock  waves  and 
has  a maximum  value  of  1/y  which  is  less  than  unity. 

We  choose  to  nondimens ionalize  the  flow  velocity  and  the  tempera- 
ture according  to 


n = u/u^ 

(8) 

t = (1-YE^)  T/T^ 

(9) 

where  u^  is  the  average  velocity  across  the  shock 

wave  and  T is  the 

0 

average  temperature.  These  average  quantities  can  be  Jetermired 
directly  from  the  Rankine-Hugoniot  conditions  (2)  which  relate  the 
final  values  of  the  flow  variables  behind  the  front  to  the  initial 
values  ahead  of  the  shock.  In  terms  of  the  shock  strength  parameter, 
the  averages  are  given  by 

T.+T.  , 2 

~ if  1-YE  T 

0 2 {l-Ye)(l+e)  i 

(10) 

U.+U-  u. 

J.  J.  1. 

% " 2 " 1+e  • 

(11) 
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The  transport  coefficients,  p and  k,  are  allowed  to  vary  with 
temperature,  but  we  take  advantage  of  the  fact  that  their  functional 
dependence  is  similar  in  the  temperature  range  of  interest.  Consequently, 
the  thermal  conductivity  and  viscosity  are  represented  by 


K = KgfCt) 
y = P^fCt) 

where  ic  and  p are  constants  and  f(l)  = 1. 

0 0 

Finally,  we  define  a nondimensional  co-ordinate 


(12) 

(13) 


(14) 


For  weak  shock  waves,  the  length  scale  p^/p^u^  is  on  the  order  of  a 
molecular  mean  free  path. 

Since  Equation  (1)  representing  conservation  of  mass  can  be  inte- 
grated directly,  the  Navier-Stokes  equations  can  be  reduced  to  a pair 
of  coupled,  nonlinear  differential  equations.  The  two  remaining 
equations  can  be  written  in  terms  of  the  nondimensional  variables  as 


jYf(t)n  Yn(i-n)  + n - t = o (15) 

1 dt  4 dn  ^ (1+y)  r f 1^2  2 1 - ,,,, 

Pr  dy  ■ 3 dy  2f(t)  [ ^ J ' ^ 

In  Equations  (15)  and  (16),  f(t)  represents  the  temperature  dependence 
of  the  transport  coefficients  and 


is  the  Prandtl  number  which  is  of  order  unity  for  gases.  For  this 
analysis,  Pr  is  a constant  since  c is  assumed  to  be  a constant  and 

p and  K have  the  same  dependence  on  temperature. 

Equations  (15)  and  (16)  are  to  be  solved  subject  to  the  boundary 
conditions  specified  by  the  Rankine-Hugoni ot  relations,  namely 


n(-“)  = l+e  ; ri(“>)  = 1-e  (18) 

t(-oo)  = (l-Ye)(l+e)  ; t(»)  = (l+Ye)(l-e).  (19) 


III.  PERTURBATION  EXPANSION  AND  FORMAL  SOLUTION.  We  now  proceed 
to  solve  Equations  (15)  and  (16)  by  a perturbation  expansion.  In  deter- 
mining the  form  of  the  expansion,  we  make  use  of  two  observations.  First, 
the  boundary  conditions.  Equations  (18)  and  (19),  indicate  that  the 
variation  of  n and  t across  the  shock  is  of  order  e.  Since  e approaches 
zero  as  the  shock  strength  decreases,  it  emerges  as  a natural  choice 
for  the  perturbation  parameter.  Second,  both  theory  and  experiment 
indicate  that  the  shock  thickness  varies  as  1/(M-1)  for  weak  shocks, 
which  in  turn  varies  as  1/e.  Therefore,  we  expect  that  dt/dy  and  dn/dy 
will  be  proportional  to  e At  and  e An,  respectively.  Actually,  this 
judgement  does  not  have  to  be  made  a priori  but  emerges  during  the 
perturbation  expansion  procedure  as  the  only  consistent  ordering  of 
the  derivatives. 


These  two  observations  suggest  expansions  for  n and  t of  the  form 


E 

j 

't>j(ey) 

(20) 

E e^ 

T 

0j (ey) 

(21) 

where  ‘('q  “ ®o  ” series  given  by  Equations  (20)  and  (21) 

correspond  to  expanding  the  velocity  about  its  midvalue  and  the  tempera- 

2 

ture  about  the  value  T (l-ye  ) which  approaches  the  midvalue  as  the 
shock  strength  diminisRes. 


In  order  to  determine  the  functions  i|)j  and  0j  , the  expansion  is 


substituted  into  Equations  (15)  and  (16)  and  the  resulting  equations 
solved  separately  to  each  order  in  e.  For  the  sake  of  brevity  the 
details  of  the  expansion  procedure  will  be  omitted.  A more  detailed 
treatment  of  the  perturbation  expansion  and  resulting  solution  can 
be  found  in  Reference  (1). 


Selecting  the  lowest  order  terms  in  the  two  equations  results  in  an 
ordinary  differential  equation  for  which  can  be  solved  to  yield 

()ij  = - tanh  (Xr)  (22) 


where 

r = ey 


All  sums  extend  from  0 to  “ unless  otherwise  noted. 
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(23) 


I 


and  a relation  between  0^^  and 


= (1-Y)<t>i- 

The  constant  X in  Equation  (22)  is  given  by 

^ 3(Y+l)Pr 
8Pr+6(Y-l)  ■ 


(24) 


Furthermore,  the  constant  of  integration  has  been  chosen  so  that  y=0 
represents  the  location  of  the  midvalue  of  the  velocity  profile. 

For  n > 2,  solving  the  system  of  equations  for  order  e"  results 
in  a differential  equation  for  which  can  be  written  as 


d(j! 

- 2 (P  = T , 
dr  ^1  ^n  n-1 


(25) 


where  is  a rather  complicated  function  of  the  n-1  preceding 

and  6.,  Since  T , is  not  a function  of  di  or  6 , Equation  (25)  is  a 
linear,  ordinary  differential  equation  which  has  the  formal  solution 

= sech^  (Xr)  Jcosh^  J ' 

Again,  9^  can  be  related  algebraically  to  and  the  preceding  (Ji^  and  9^. 

Consequently,  it  is  possible  to  obtain,  at  least  formally,  as  many 
terms  in  the  expansion  as  one  wishes  by  repeatedly  evaluating  the 
integral  in  Equation  (26).  In  practice,  this  soon  becomes  difficult  to 
do  analytically  since  ^ becomes  progressively  more  complicated  as  n 
increases.  However,  we  were  able  to  evaluate  the  integral  explicitly 
to  obtain  the  second-order  term,  that  is  the  term  in  the  solution  of 
2 

order  e . The  resulting  second-order  approximations  to  the  velocity 
and  temperature  profiles  are  given  by 


f 2 2 

n-  “ ^ u = 1-e  tanh  (Xr)  + e A sech  (Xr)5,n  sech  (Xr)  (27) 
^ o 

= 1 + e(Y-l)  tanh  (Xr) 

, (28) 
2 j 2 

+ E ^ - A(y-1)  sech  (Xr)  Hn  sech  (Xr) 

+ y[  (1-4X/3)  sech^(Xr)  - 1 ] | 
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where 


3(y^-1) (4Pr-3) 
[4Pr+3(Y-13]^ 


(29) 


i 


IV.  DISCUSSION.  It  is  apparent  that  as  r ->■  ± «>  the  second-order 
solutions,  given  by  Equations  (27)  and  (28),  are  in  identical  agreement 
with  the  Rankine-Hugoniot  relations,  expressed  in  the  form  of  Equations 
(18)  and  (19),  for  all  shock  strengths.  Of  course,  agreement  at  the 
endpoints  does  not  guarantee  that  the  approximate  solutions  adequately 
represent  the  entire  profile.  In  an  attempt  to  estimate  the  range  of 
validity  of  the  approximate  solutions,  we  compared  them  with  a special- 
case  calculation  which  is  known  to  have  an  analytic  solution. 

As  first  shown  by  Becker  (3),  the  steady-state  Navier-Stokes 
equations  can  be  solved  exactly  for  the  case  where  p and  k are  temper- 
ature independent  and  Pr  = 3/4.  We  have  compared  our  approximate 
solutions  with  the  exact  solution  for  that  case  with  y equal  to  5/3. 

In  Figure  (2),  we  show  the  first-  and  second-order  approximations  to  the 
velocity  profile  for  a Mach  number  equal  to  2.  This  value  represents 
approximately  the  maximum  Mach  number  for  which  the  Navier-Stokes  equations 
themselves  are  valid.  For  this  case,  the  exact  solution  is  indistinguish- 
able from  the  second-order  approximation,  the  difference  being  less  than 
1%  throughout  the  entire  range  of  r.  Although  the  results  are  not 
reproduced  here,  the  second-order  approximation  to  the  temperature  ratio 
was  also  found  to  be  within  1%  of  the  exact  solution  at  M=2. 

The  agreement  was  even  better  for  smaller  values  of  M,  as  would  be 
expected.  As  M increased,  the  deviation  between  the  exact  and  second- 
order  profiles  increased;  but  even  for  a Mach  number  of  10,  the  difference 
was  less  than  15%.  Of  course,  the  results  at  large  Mach  numbers  are  only 
of  academic  interest  since  the  original  equations  are  then  no  longer  valid. 

These  results  are  somewhat  surprising  since  other  expansions  using 
M-1  as  a perturbation  parameter  are  valid  over  a much  smaller  range.  It 
appears  that  in  this  case  the  choice  of  an  expansion  parameter  significantly 
affects  the  accuracy  of  the  approximation. 

There  are,  in  fact,  two  indications  that  suggest  that  e is  a more 
appropriate  expansion  parameter  than  M-1.  First,  e never  exceeds  unity 
as  the  Mach  number  increases,  whereas  M-1  increases  without  bound  as 
M Second,  with  our  nondimensionalization  we  were  able  to  satisfy 

the  boundary  conditions  exactly  for  all  Mach  numbers  with,  at  most,  a 
second-order  approximation,  whereas  all  the  terms  in  the  series  are 
required  if  M-1  is  used  as  an  expansion  parameter. 
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Figure  2. 
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Comparison  of  first-  and  second-order  solutions  for  temperature- 
independent  transport  coefficients.  Becker's  solution  for  n 
is  not  shown  since  it  was  found  to  be  coincident  with  ri2-  The 

results  are  plotted  for  the  following  values  of  the  parameters: 
M=2,  Pr=3/4,  y=5/3. 
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The  .second-order  approximation  can  be  used  to  examine  the  effect 
of  temperature -dependent  transport  coefficients  on  the  shock  profile 
since  the  terms  of  order  e in  Equations  (27)  and  (28)  depend  on  df/dt 
at  t=l.  For  example,  simple  kinetic  theory  (4)  predicts  that  for  a 
hard-sphere  gas  the  Prandtl  number  is  equal  to  2/3  while  the  viscosity 
and  thermal  conductivity  are  given  by 

u = Uq  t (30) 

ic  = t ^ . (31) 

In  Figure  3,  we  have  plotted  r\2  calculated  from  Equation  (27)  for 

the  case  in  which  M=2,  Pr=2/3,  y=5/3,  and  u and  k are  given  by  Equations 
(30)  and  (31).  The  profile  is  compared  with  the  solution  for  obtained 

by  holding  y and  k constant  at  their  upstream  values.  Both  profiles  are 
plotted  as  a function  of  the  dimensionless  parameter  z = Xep^  u^  x/y^. 

It  is  apparent  from  the  figure  that  including  the  temperature-dependence 
of  the  transport  coefficients  changes  the  value  of  r]^  by  as  much  as  12% 

at  this  Mach  number.  The  broader  front  in  the  temperature-dependent  case 
is  to  be  expected  since  the  dissipation  is  enhanced  by  allowing  y and  k 
to  increase  with  temperature. 


V . SUMMARY . We  have  developed  a perturbation  expansion  for  the 
Navier-Stokes  equations  describing  steady-state,  one-dimensional  shock 
propagation  in  an  ideal  gas.  Formally,  the  expansion  can  be  evaluated 
to  any  order  by  quadrature.  The  second-order  solution  has  been  determined 
explicitly  and  this  solution  appears  to  accurately  represent  the  shock 
profile  for  the  range  of  Mach  numbers  in  which  the  Navier-Stokes  equations 
themselves  are  valid.  The  expansion  provides  a simple,  analytic  method 
for  investigating  the  effect  of  temperature- dependent  transport  coeffi- 
cients on  the  shock  profile. 
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A GENERALIZED  COMPARISON  PRINCIPLE  AND  MONOTONE  METHOD  FOR 


SECOND  ORDER  BOUNDARY  VALUE  PROBLEMS  IN  BANACH  SPACES* 

S.  R.  Bernfeld^ , V.  Lakshraikantham^ , S.  Leela^’^ 

kbitAOLcX 

A generalized  comparison  principle  for  second  order  differen- 
tial inequalities  is  established  in  a Banach  space  where  the  in- 
equalities are  given  relative  to  an  arbitrary  cone.  In  case  the 
< 

Banach  space  is  the  real  line  the  comparison  principle  reduces  to 
the  classical  maximum  principle. 

The  comparison  principle  is  then  used  to  develop  a monotone 
method  to  generate  two-sided  bounds  on  solutions  of  nonlinear  boun- 
dary value  problems  for  ordinary  differential  equations  in  a Ba- 
nach space. 

I.  IntAoduiCtion 

Monotone  methods  have  been  used  to  generate  maximal  and  mini- 
mal solutions  of  nonlinear  boundary  value  problems  for  both  ordin- 
ary and  partial  differential  equations.  This  study  was  originally 
motivated  by  the  problem  of  extending  the  chord  method  as  used  by 
Keller  [8]  and  Sattinger  [11],  who  considered  nonlinear  partial 
differential  equations  containing  no  gradient  terra.  The  inclusion 
of  a gradient  term  was  first  introduced  by  Chandra  and  Davis  [5]  who 
considered  the  ordinary  boundary  value  problem 

u"  = f(t,u,u')  (1.1) 

B^u  = a..u(i)  + = b.,  i = 0,1.  (1.2) 

1 IX. 

Here  f e CllO,l]  x R x R,  /?],  aQ,oij  ^ 0,  Sj  > 0. 

They  assumed  that  f depended  linearly  on  u' . This  restriction 
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on  u’  was  eliminated  by  Bernfeld  and  Chandra  [2].  The  approach 
that  is  used  in  [5]  and  [2]  is  to  first  ascertain  the  existence  of 
a lower  solution  and  upper  solution  such  that 

tiQ(t)  ^V^d).  By  assuming  a Nagurao  Condition  [3]  on  f(t,u,u') 
one  is  able  by  standard  methods  to  obtain  a uniform  estimate  on 
the  derivative  of  any  solution  z(t)  of  (1.1)  such  that 
^z(t)  ^V^(t)  t e 10,2],  These  conditions  imply  the  existence 
of  solutions  of  (1.1)  and  (1.2).  In  order  to  obtain  the  monotone 
iterations,  one  linearizes  f(t,u,u')  in  the  variable  u about 
any  element  z(t)  such  that  ^z(t)  ^v^dj.  More  precise- 

ly consider 

u"  = Fd,u,u'  ,z) , (1.3) 

where  Fd,u,u' ,z)  = fdjZ,u')  + tiu  - Hz,  where  £ z(t) 

^ V^d)  and  b is  an  upper  bound  on  \f^d,u,u')\  for  t e 
10,2],  u^d)  ^ ud)  ^V^d)  and  |k'|  where  N is  obtained 

from  the  Nagumo  Condition.  The  existence  of  solutions  of  (1.3) 
and  (1.2)  follows  as  in  the  problem  (1.1), (1.2).  By  using  the  clas- 
sical maximum  principle  one  obtains  for  each  zd)  such  that 
u^d)  ^ zd)  ^V^d)  a unique  solution  w(t)  of  (1.3),  (1.2) 
such  that  ^wd)  ^V^d).  If  we  define  the  mapping  k to 

be  governed  by  the  rule  W = Az,  then  by  the  maximum  principle 
one  has  that  Au^  > u^,  Av^  £ and  A is  monotone  on  < u^, 

Vq  >,  that  is,  Zj  £ ^2  £ ^^2'  Finally  by  defining  the 

sequences  u = Au  ,,  V = Av  , one  shows  that  lu  } and  {v  } 

^ n n-2  n n-2  '■  n‘  ^ n‘ 

converge  uniformly  and  monotonically  to  the  minimal  and  maximal  so- 
lutions respectively  of  the  BVP(l.l),  (1.2). 

The  extension  of  these  results  to  and  infinite  dimen- 

sional systems  of  the  type  (1.1)  is  important,  for  parabolic  equa- 
tions of  the  form 

^xx  " 

can  be  approximated,  using  the  method  of  lines,  by  n-dimensional 
and  infinite  dimensional  systems  of  type  (1.1)  (see  Llskovets  [9] 
and  Thompson  [13]).  Char.dra,  Lakshmikantham,  and  Leela  [6]  exten- 
ded the  development  of  the  monotone  method  to  the  Banach  space  E 
of  bounded  sequences  i = ^ particular  they  con- 

sidered the  BVP(l.l),  (1.2)  in  which  f e C ]_[0,2]  x E x E,  £], 
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One  of  the  biggest  problems  in  extending  the  scalar  case  is 
the  lack  of  a suitable  maximum  principle  in  higher  dimensions.  In 
[6]  a comparison  principle  in  E was  developed  which  played  the 
same  role  as  the  classical  maximum  principle  for  the  scalar  prob- 
lem. All  Inequalities  are  assumed  to  be  componentwise.  An  appro- 
priate existence  theorem  in  Banach  spaces  is  used  in  which  the  mod- 
ified function  approach  is  the  principle  tool.  In  particular  ap- 
propriate compactness  type  conditions  are  imposed  on  f in  terms 
of  the  measure  of  noncompactness  (see  J.  Chandra,  V.  Lakshmlkantham, 
A.  R.  Mitchell  [7]).  The  comparison  result  allows  us  to  assert 
that  the  solution  of  (1.1),  (1.2)  lies  between  and 

the  lower  and  upper  solutions  respectively.  (See  also  R.  Thompson 
[14]  for  a good  discussion  of  existence  in  the  space  E.)  The  com- 
parison principle  thus  becomes  the  main  tool  in  the  development  of 
the  monotone  method.  Conditions  such  as  quasimonotonicity  on  f 
and  the  restriction  that  f-(t,u,u')  = f.(t,u,u')  are  necessary 

"V  ^ "If 

in  order  to  successfully  develop  the  monotone  method.  Recall,  for 
example,  that  in  iP' , n > 1,  the  only  known  methods  showing  that 
£ V^d),  where  3nd  ^re  lower  and  upper  so- 

lutions in  the  usual  sense 

I'o" 

are  to  require  that  f .d,x,x' ) i f .(t,x,x . One  can  require 

^ If  % 

more  conditions  on  and  V^,  thus  eliminating  the  restric- 

tions on  f,  but  this  seems  to  be  incompatible  with  the  develop- 
ment of  the  monotone  method. 

As  in  scalar  case  the  comparison  principle  yields  the  unique- 
ness of  the  solutions  of  system  (1.3)  and  leads  to  the  subsequent 
development  of  the  monotone  operator  A (see  Section  IV). 

In  this  note  we  shall  extend  the  comparison  theorem  to  arbi- 
trary Banach  spaces  E.  We  shall  deal  with  the  second  order  dif- 
ferential inequalities  in  which  tlie  inequality  relation  is  induced 
by  a cone  K in  E.  We  shall  discuss  the  natural  development  of 
the  monotone  method  but  not  present  the  details  here.  Our  results 
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will  include  those  in  [5],  [2],  [6]  and  complement  the  results  of 
R.  Thompson  [15].  We  will  be  more  precise  about  this  later. 

2.  PKeJUmina/U.tl> 

Let  B he  a Banach  space  with  norm  | • | and  let  B*  denote 
the  set  of  continuous  linear  functionals.  Let  be  a cone  in  B 
which  induces  a partial  ordering  ^ as  follows:  x if  and  only 

if  y - X € X. 

A linear  functional  e B*  is  called  a positive  linear  func- 
tional if  ^(x)  ^ 0 whenever  x e K.  Let  K*  denote  the  set  of 
positive  linear  functionals.  Notice  then  that  K is  contained  in 

C 

the  closed  halfspace  = {x  e B 1 ^ 0,  ^ a positive  linear 

functional}.  Thus  the  positive  linear  functionals  are  support 
functionals  and  since  ^ is  a cone  in  B,  then  K is  the  intersec- 
tion of  all  the  closed  half-spaces  which  support  it.  If  C K* 
and  X = I"*  ) (J)  6 then  we  say  X^  generates  X.  Let  X^ 

= {<()  e X-  I II  $ II  = j},  and  X the  closure  of  X in  the  weak 
star  topology. 

Denote  by  int  X,  the  interior  of  X and  if  int  K / 0,  then 
X is  called  a solid  cone.  It  is  interesting  to  note  that  x 6 int 
X if  and  only  if  there  exists  C > 0 such  that  <^(x)  ^ C for  all 

^ u 

3.  CompaAyUon  PeMitt 

As  indicated  in  the  introduction  an  essential  feature  of  the 
monotone  method  for  (1)  (2)  is  the  application  of  a comparison 
principle,  which  in  one  dimension,  is  the  classical  maximum  prin- 
ciple. We  will  state  the  comparison  result  and  indicate  its  impli- 
cations. 

We  consider  the  boundary  value  (1.1),  (1.2),  where  f : 

I X B X B B is  continuous  with  I = [O,!].  Let  X be  a cone 
in  B generated  by  X^. 

Definition;  We  say  f(t,x,x')  is  quasimonotone 
nonincreasing  in  x with  respect  to  X if  x ^ y, 

(|)('x)  = (i)(y),  ((iCx'J  = 'ti(y’)  implies  (t>{f(t,x,x')) 

1 for  all  (p  e x^. 
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For  the  case  in  which  B - R , 1 ^ n ^ where  K is  the 
cone  generated  by  the  positive  projections,  this  condition  on  / 
implies  that  = f^(tyX,x^').  This  can  be  seen  by  let- 
ting X = y,  and  thus  obtaining  f^(t,x,x')  = f ^(t,x,y ' } when- 

ever X.'  = y.'.  If  for  example  f(t,x,x')  is  linear  in  x and 
x'  then  f(t,x,x')  = P(t)x'  + Q(t)x,  where  P(t)  is  a diagonal 
matrix  and  Q(t)  is  a matrix  in  wViich  q^j(t)  ^ 0 i ^ j. 

We  consider  a family  of  functions  \ > 0,  t e [0,1]^ 

and  we  shall  write  Z(X,t)  B Z^(t) . We  say  this  family  is  admis- 
sible if  Z(\^')  is  e for  each  X and  Z(‘,t)  is  con- 
tinuous in  X for  each  t and  Z(0,t)  = 0 for  0 ^ 1.  The 

family  is  said  to  satisfy  a uniformity  condition  at  X = 0 (X  = °o) 
if  for  each  (})  e -*  0 as  X -*■  0 uniformly  for  t e 

{o,n  ((i>(z/t;)  =0  as  X “ uniformly  for  t 6 [0,1]].  An  ex- 

ample of  such  a family  is  {X  Z(t)],  X > 0,  Z(t)  continuous, 
<i)[Z(t)]  > 0 for  each  t e [0,1]  and  for  each  ((>  6 K^, 

We  are  now  ready  to  state  our  main  result  of  this  section. 

ThfcCiAgJii  3.t  Lev.  B be  a Banach  space  and  K a cone  in  B,  As- 

sume f(t,x,x')  is  continuous  and  quasimonotone  nonincreasing  in 
X with  respect  to  K.  Suppose  V and  W are  lower  and  upper  so- 
lutions of  (1.1)  and  (1.2)  respectively;  that  is,  V,l</  6 C^[I,B] 
and  for  ttl,i=  0,1 

V"(t)  >f[t,V(t),V'(t)],  b\v  ±b^ 

W"(t)  <f[t,W(t),W'(t)),  B^W^b.. 

Let  be  an  admissible  family  satisfying  a uniformity  con- 

dition at  X = 0 and  X = °°  such  that  for  each  t e [0,1]  and 
for  each  d)  6 

u 

^[z^"(t))  < (]>[f{t,W(t)  + Z^(t),W'(t)  + z^dt)]]  ^3  2) 

- ,w'(t)]] 

and  B^Z^(0)  > 0 B^Z^(l)  > 0.  Then  V(t)  < W(t)  on  J. 

This  theorem  includes  a result  of  Schroder  [12,  references 
therein]  who  considered  the  case  in  whicli  B = R.  Moreover  our 
result  includes  tl\e  comparison  theorem  in  [6]  which  was  concerned 


with  the  case  B = R 1 < n < 


In  that  case  K is  the  set  of 


vectors  all  of  whose  components  are  greater  than  or  equal  to  zero 
and  are  the  positive  projections. 
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We  now  apply  Theorem  3.1  to  the  case  in  which  f is  linear, 
that  is,  f(t^x,x')  = Q(t)x  + P(t)x',  where  P(t)  and  Q(t)  map 
B B linearly.  Then  the  quasimonotonicity  of  f with  respect 
to  K implies  that  x £ J/,  i>(x)  = ^(y)  then  ^[OJDx]  <\i[Q(t)y] . 
Thus  Q(t)x  is  quasimonotone  nonincreasing  in  x with  respect  to 
K.  Moreover  we  find  that  (J) (Pft Jx ')  = (p[pft)y']  when  <i)(x')  = 
^(y’)  (in  case  B = 1 , this  implies  that  P(t)  is  a 

diagonal  matrix,  where  K is  the  cone  of  vectors  whose  components 
are  nonnegative) . We  thus  arrive  at  the  following  corollary  assum- 
ing all  the  hypotheses  of  Theorem  3.1  in  the  special  case  that  f 
is  linear. 

i 

CoKoZJiaAij  3.  ? Suppose  that 

(i)  y e C'^[I,B]  and  for  t % I 

W"  <P(t)W'  + Q(t)W,  B^W^O,  i = 0,1, 
where  Q(t)  is  a quasimonotone  nonincreasing  linear  mapping  with 
respect  to  K and  P(t)  has  the  property  that  i^[P(t)x’]  = 
^{P(t)y'']  whenever  ^(x')=  '^(y’)  for  each  <()  e 

(ii)  let  {Z^(t)}  be  an  admissible  family  satisfying  a uni- 
formity condition  at  X = 0,  X = co  such  that  for  each  t e [0,1] 
and  for  each  6 € X 

u 

^[(Z^")(t))  < if[Q(t)Z^(t))  + i?[P(t)Z^'(t)]  and  B°{z^(0)]  > 0, 

b\z^(i))  '>  0. 

Then  y(t)  £ 0, 

The  proof  of  this  corollary  follows  from  Theorem  3.1  by  let- 
ting V 0 . In  the  case  B = R this  corollary  corresponds  to 
the  generalized  maximum  principle  [10,  p.  8]. 

We  observe  that  the  generalized  comparison  theorem  has  much 
flexibility  because  for  a given  f(t,x,x')  one  may  have  many  cones 
in  which  / is  quasimonotone.  Moreover  we  do  not  require  the 
cones  have  an  interior,  as  is  often  assumed,  for  the  strict  in- 
equalities used  here  are  in  terms  of  linear  functionals.  Thus  al- 
though X > 0 usually  implies  that  x is  in  the  Interior  of  a 
cone,  we  only  require  <^(x)  > 0 for  each  <j)  6 K^.  This  may  occur 
in  cones  with  no  interior  such  as  in  TP . 
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4.  Monotone.  Method 


( 


We  shall  briefly  discuss  the  monotone  method  here. 

The  first  step  is  to  obtain  existence  of  solutions  of  (1.1) 
and  (1.2).  We  do  this  by  using  the  modified  function  approach  [3] 
combined  with  a Nagumo  condition,  assuming  the  existence  of  lower 
and  upper  solutions  V(t),  Vit)  respectively  with  V(t)  ^W(t) . 
The  system  (1.3)  is  developed  as  follows:  assume,  as  before, 
f(tiXjy)  is  quasimonotone  in  x with  respect  to  K.  Let 
f(tiX,y)  be  continuously  differentiable  in  x and  y,  and  for 
t Z I,  V(t)  X ^y(t)  and  |y|  y (obtained  using  Nagumo  con- 
dition, see  [3]),  the  Frechet  derivative  of  f with  respect  to  X, 
satisfies  for  each  z e B 

D^(t,x,y)[z]  ^ Q(z)  , (4.1) 

where  Q(z)  is  quasimonotone  nonincreasing  in  z with  respect  to 
K.  We  then  define  for  any  element  n(t)  such  that  V(t)  ^n(t) 

< W(t) 


F(t,Xjx')  = F(tyX,x'  ,n) 

= F[t,n(t) ,x']  + q{x  - n(t))  . 

Using  the  above  arguments  it  can  be  shown  that  the  boundary 
value  problem  (1.2)  and 

x"  = F(t,XyX' ) (4.2) 

has  a unique  solution  x(t)  such  that  V(b)  ^x(t)  ^W(t)  for 
each  n(t)  such  that  V(t)  ^n(t)  ^W(t). 

For  each  n e C(I,B)  such  that  V(t)  ^n(t)  ^W(t)  on  I, 
define  the  mapping  A by 
An=  X 

where  x is  the  unique  solution  of  the  boundary  value  problem 

(4.2),  (1.2).  Using  Theorem  3.1  one  can  establish  that  A is  a 

monotone  operator  on  the  segment  <V,W>=  (w  € E : V(t)  w £ 

W(t),  t 6 j}.  If  we  define  the  sequences 

V = AV  y = AW  (4.3) 

n n~l  n n-1' 

where  = V and  W^  = W we  can  state  our  main  result  of  this 
section : 


Thc.OA.en]  4 , I Assume  f : I x B k B -*■  B is  completely  continuous 
and  that  the  hypotheses  of  Theorem  3.1  Is  satisfied.  Assume  the 
operator  Q defined  in  (4.1)  is  completely  continuous  and  that 
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F(t,x,  x' ) satisfies  a Nagumo  condition.  Then  the  sequences 

defined  by  (A. 3)  converge  uniformly  and  monotonlcally  to  the 

minimal  and  maximal  solutions  V . , W respectively  of  the 

mzn  max  ^ 

boundary  value  problem  (1.1),  (1.2)  on  <VfW>, 

If  there  exists  a unique  solution  of  (1.1),  (1.2)  then  the  se- 
quences converge  uniformly  to  the  solution,  and  we  thus 

have  a constructive  technique  to  obtain  solutions. 

One  may  weaken  the  assumption  of  complete  continuity  by  assum- 
ing conditions  on  f in  terms  of  the  measure  of  noncompactness  [7). 

One  application  of  our  work  arises  in  the  study  of  stochastic 
differential  equations.  For  example,  recent  work  in  [A]  suggests 
that  an  appropriate  model  for  the  birefringence  of  a solution  of 
proteins  polymerizing  under  an  electric  field  is  the  system  (1.1), 
(1.2).  Here  the  Banach  space  B would  be  the  space  of  distribu- 
tion functions  over  (0,'=°)  since  at  any  given  time  the  length  of 
the  polymers  form  a certain  type  of  distribution  which  varies  with 
time. 

Finally  there  has  been  other  work  concerned  with  the  problem 
with  which  we  have  addressed  ourselves.  In  particular  Amann  and 
Crandall  [1]  have  obtained  a monotone  method  for  nonlinear  semi- 
linear  elliptic  boundary  value  problems. 
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Coramunlcations/luloinatlc  DatK  Processing  Xabomtoi/ 

U.  S.  JLiiaj  Electronics  Cammand,  Port  Hsnmoutb,  Ksw  Jerse/  07703 


1.  IntrtxiuctloD.  We  consider  the  equation 

(1)  Lu  : u"  + pu'  - qu  - 0 

and  solutions  u - u(x)  vhlch  are  positive  on  an  Interval 

(2)  I - = 03. 

iuJoptlng  an  Idea  suggested  bj  Qarrett  Birkhoff,  we  use  Stunrdan- 
type  arguaients  to  determine  how  majorizing  p - p(x)  or  q " q(x) 
— i.  e.,  replacing  p or  q with  a larger  function  — affects  the 
ir-agnltude  of  the  positive  solution.  This  leads  to  some  inter- 
esting • corollaries;  e.  g.,  if  u is  a positive  solution  of 
(ru’)'-qu=0onl  .with  0 <r  rq  = c on  I and  u'  ^ cu/r  at  Xq, 

^ f'  JL 

then  u = UQexp(cJ^  dt/r(t)).  These  results  appear  to  be  new 
even  thougn  the  technique  is  classical. 

Throughout,  we  shall  let  u^  = c 

2.  Kajoilzing  the  coefficient  p.  Typical  is  our  first 
result,  in  which  we  coiiroare  the  damping  coefficients  p in  two 
equations  of  the  form  (1). 

Lemma  1.  Let  u be  a positive  solution  of  (l)  and  v = v(x) 
a positive  solution  of 

(3)  v"  + p^V  - qv  •=  0 
on  the  interval  I. 
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(i)  If  p < and  v'  i 0 on  I,  and  u'v-uv'  i 0 at  Xq,  then 
'^'^0  * ^^^0 

(ii)  If  p ^ Pj^  and  V 4 0 on  I,  and  u'v-uv'  < 0 at  Xq,  then 
u/Uq  ^ v/vq  on  I. 

(iii)  If  p i p^  and  v'  i 0 on  I,  and  u'v-uv'  ^ 0 at  x^,  then 
u/Uq  4 v/Vq  on  I. 

(iv)  If  p p^  and  v'  ^ 0 on  I,  and  u'v-uv'  ^ 0 at  Xq,  then 
u/Uq  i v/vq  on  I. 

Proof.  Multiply  (1)  by  v and  (3)  by  u and  subtract,  getting 
v'  + pv  “ (p^-p)uv',  where  w z u'v-uv'.  Thu3[exp(j^  p dt)w^'  * 
(Pl"P)®^(jx^  P dt)uv'  whence,  for  case  (i),  exp(f^  p dt)w  i 
w(xq)  = 0.  Thus  w 2 u'v-uv'  i 0.  Dividing  by  uv  > 0 and 
.integrating  complete  the  proof  of  the  first  case.  The  other 
cases  are  proved  similarly. 

Example  1.  Consider  the  functions  v = exp(tcxV2),  with 

0 < c = const.,  which  are  linearly  independent  solutions  of 
2 2 

v"  - v'/x  - c X V = 0.  Then  when  Lu  = 0 and  Xq  > 0, 

(i)  If  p S -1/x  and  u'(xq)  i cxqUq,  then  u ^ UQexp[c(x^-XQ2)./2]| . 

(ii)  If  p ^ -1/x  and  u ' (x^)  $ -cx^u^,  then  u < UgSxp  [;-c(x^-Xq^)/2J  . 

(iii)  Reverse  all  inequalities  in  (i). 

(iv)  Reverse  all  inequalities  in  (ii). 

Now  assume  0 < q € C^(l)  and  consider 

(1|)  v"  + (2aq^  - q'/2q)  v'  - qv  “ 0, 


with  solutions  |i 


(5)  V - e2p[(-ai/a2|-l)/jj* 

on  I.  Then  Lemma  1 immediately  yields  the  following  result. 


Theorem  1.  Let  u be  a positive  solution  of  Lu  = 0 on  I 
with  0 q t C^(I).  Than  with  a = const,  and  s -ai(a^+l)^, 
we  have 

(i)  If  p 4 i^aq'^  - q'/2q  on  I suid  u'  i b^qAi  at  Xq,  then 
u •%  UQe:q)[^b+ixp  q^  dt^  on  I. 

I } 

(ii)  If  p ^ 2aq2  - -i'/’-  i on  I and  u'  ^ b_q^u  at  Xq,  then 
u ^ u^axp[b_i^^  dtj  on  I. 

(ill)  Ravarsa  all  inaqualitLes  in(i). 

(iv)  Keversd  all  inequalities  in  (ii). 

The  special  cases  a = 0 (whence  b^  = ;;l)  and  p ; 0 are 


particularly  interesting.  Taking  the  extreme  case  that  both 
a = 0 and  p 5 0,  we  have  the 

Corollary.  Let  u be  a positive  solution  of  u'  - qu  = 0 
on  I,  with  0 <1  q t C^(I)  • Then 

(i)  If  q'  < 0 on  I and  u'  ^ q^  at  x,,  then  u >n^‘2xp{J‘  q%t)  on  I. 

- U - J Xq 

(ii)  If  q'  ^ 0 on  I and  u'f -q^i  at  Xq,  then  u iUQexp(-J^^qmt)  on  I. 

(iii)  Il;verse  ail  inequalities  in  (i). 

(iv)  lif'verse  all  inequalities  in  (ii). 

3.  Ka j oriaing  the  coefficient  Mow  we  compare  the 
restoring  coefficients  q in  two  equations  of  the  form  (1). 

Theorem  Let  \i  and  v be  positive  solutions  of  (1)  and 
('■■')  V + pv'  - q^v  -■  0, 


I 

! 


respectively,  on  I.  If  q 5 on  I and  u'v-uv'  ^ 0 at  Xq,  then 
u/uq  ^ v/vq  on  I. 

Proof.  Multiply  (l)  by  v and  (6)  by  u and  subtract, 
getting  w'  + pw  = (q-q-j^)uv,  where  w = u'v-uv'.  Then 


^ = (q-qj^)uv  P dt). 


'.ie  conplete  the  proof  by  integrating  and  considering  the  signs 
of  u,  V,  q-q^  and  w(xq) . 

The  conclusion  of  Theorem  2 is  clearly  valid,  verbatim, 
when  u and  v are  positive  solutions  of  the  self-adjoint  equations 
(ru')'  - qu  = 0 and  (rv')'  - q^v  = 0 on  I,  with  r > 0.  Now 
consider  in  particular  the  self-adjoint  equation 
(7)  (rv')'  - (c'^/r)v  = 0,  r = r(x)  >0,  c = const. 0, 


wita  a solution 

(3)  V = u^exj-^CcX^  dt/r(t)). 

Then  conclude 


Ccroll '.ry  . Let  u ■>:;  a positive  solution 'of  (ru')'  - qu  * 0 
on  I,  with  r > 0 and  ^q  > 0.  (i)  If  rq  ^ c'"  on  I and  u'  4 cu/r 

at  Xq,  then  u <UQexp{of^^  dt/r(t)).  (ii)  Reverse  all  inequal- 
itiss  in  (i). 

Another  application  of  Theorem  2 is 
Corollary  2.  Let  u be  a positive  solution  on  I of 
u"  + 2fu'  - qu  = 0,  f ^ C^(I).  Then  for  yiy  c = const.,  we  have 
(i)  If  q 1 -f^-f '+c^  on  Itand  u'  < (c-f)u  at  x^,  then 

f 


j 


1 
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V"  + 2fv'  + (f^+f'-c^)v  - 0, 
f ^ 

witn  a solution  v = e:qp(cx  - J f{t)dt). 

loiample  2.  If  we  now  consider  the  exact  equation 
v"+pv'+p'v=0  and  apply  Tneorem  2,  we  find  that  if  u is  a 
positive  solution  of  Lu  = 0 and  if  q ^ -p',  then 

u 4 P P d3)dt], 

where  a = (u’-*Tpu)|  . 

Similarly,  by  considering  the  equation  v"  pv'  =0,  we 
obtain 

Corollary  3.  Let  u be  a positive  solution  of  Lu  = 0,  and 
suppose  that  'an.  + u'(x_  )X:-  sxp(- f ^ p ds)dt  > 0 on  I.  Then 

(i)  If  q > 0,  then  u ^ u + u'vx^) /*exp(-  f^p  ds)dtj  and 

- u •'Xq 

(ii)  Reverse  all  inequalities  in  (i) . 

The  differential _inequality_Lv  ^0.  Similar  techniques 
perrdt  us  to  derive  a result  reminiscent  of  Caply gin's ( [l,  p.  13?], 
[3>  P*  15] ),  which  requires  that  q ^ 0.  The  following  theorem 
eliminates  that  condition  at  the  expense  of  requiring  that  u > 0 
and  V > 0.  7nile  this  r-r.sult  is  kncim  [2,  pp.  27-28],  we  include 
it  hare  because  it  is  an  almost  trivial  ccroUary  of  Thacrem  2. 
fheprom  3-  Let  v be  a given  positive  function  ^ C‘'(I)  and 
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u be  arv  positive  solution  ol'  (1)  on  I.  Tr.en  (i)  It  Lv  ^ 0 on  I 
and  u'v  - uv'  ^ 0 at  x^,  then  u/Uq  ^ '''/Vq  on  I;  and  (ii)  heverse 
all  inequalities  in  (i). 

Proof.  Tfte  given  function  v satisfies  (6)  with  5 (v''+pv')A* 
Since  qp-q  “ L(v)/v,  the  conclusion  is  an  immediate  consequence 
of  Tneorem  2 , 

Kow  by  considering  convenient  comparison  functions  v and 
applying  Theorem  3>  we  obtain  the  following  result,  which  is 
independent  of  the  dairping  coefficient  p. 

Corollary  1.  Suppose  Lu  = 0 on  I.  (i)  Let  q > 0;  if 
Uq  > C and  u'(xq)  > 0,  then  u ^ u^  on  I.  (ii)  Let  q S 0;  if 
u > 0 on  I and  u ' (xq)  < 0 , then  \i  4 Uq  on  I . 

Proof.  Consider  the  constant  function  v = Uq  and  apply 
Theorem  3 Por  the  cases  q = 0 and  q = 0. 

oimilarly,  by  considering  the  function  v = e *,  we  obtain 

Corollary  2.  If  u is  a positive  solution  of  (l)  on  I and 
p^+liq 0,  then  u 4 u^exp  [(u '/u)  ( (x-x  )J  on  I.  On  the  other  hand, 
the  last  inequality  is  reversed  if  c^+pc-q  ^ 0 on  I,  where  c s u'(xq)/Uq. 


1 
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THE  COLLAPSED  CUBIC  ISOPARAMETRIC  ELEMENT 
AS  A SINGULAR  ELEMENT  FOR  CRACK  PROBLEMS 


S.  L.  Pu,  M.  A.  Hussain  and  W.  E.  Lorensen 
Benet  Weapons  Laboratory 
Watervliet  Arsenal 
Watervliet,  New  York  12189 


ABSTRACT . For  the  12-node  quadrilateral  isoparametric  elements,  it 
is  shown  that  the  inverse  square  root  singularity  of  the  strain  field  at 
the  crack  tip  can  be  obtained  by  the  simple  technique  of  collapsing  the 
quadrilateral  elements  into  triangular  elements  around  the  crack  tip  and 
placing  the  two  mid-side  nodes  of  each  side  of  the  triangles  at  1/9  and 
4/9  of  the  length  of  the  side  from  the  tip.  This  is  analogous  to  placing 
the  mid-side  nodes  at  quarter  points  in  the  vicinity  of  the  crack  tip  for 
the  quadratic  isoparametric  elements. 

The  advantages  of  this  method  are  that  the  displacement  compatibility 
is  satisfied  throughout  the  region  and  that  there  is  no  need  of  special 
crack  tip  elements.  The  stress  intensity  factors  can  be  accurately 
obtained  by  using  general  purpose  programs  having  isoparametric  elements 
such  as  NASTRAN. 

1.  INTRODUCTION.  The  direct  application  of  the  finite  element  method 
to  crack  problems  was  studied  by  a number  of  investigators  [1-3].  No 
special  attention  was  given  to  the  singular  nature  of  stress  and  strain 
at  the  crack  tip.  Because  of  the  large  strain  gradients  in  the  vicinity 
of  a crack  tip,  it  requires  the  use  of  an  extremely  fine  element  grid  near 
the  crack  tip.  By  comparing  the  finite  element  result  of  displacement 
components  or  stress  components  at  a nodal  point  with  the  corresponding 
asymptotic  result  of  displacement  or  stress  components  at  that  node,  the 
stress  intensity  factor  could  be  estimated.  The  estimated  value  of  stress 
Intensity  factor  varies  over  a considerable  range,  depending  on  which  node 
is  taken  for  computation.  This  results  in  poor  estimates  if  displacements 
are  taken  at  nodal  points  either  very  close  to  or  far  away  from  the  crack 
tip. 


An  improved  finite  element  technique  was  developed  by  Wilson  [4] . It 
combined  the  asymptotic  expansion  of  displacements  in  a small  circular  core 
region  surrounding  a crack  tip  and  the  finite  element  approximation  outside 
a polygon  approximating  the  circular  arc  of  the  core  region.  The  displace- 
ment fields  obtained  from  these  two  approximations  are  not,  in  general, 
continuous  along  the  asymptotic  expansion-finite  element  interface  except 
at  discrete  nodal  points. 

An  alternative  finite  element  approach  to  crack  problems  is  the  use  of 
special  elements  in  the  region  of  the  crack  tip,  e.g.  [5-7].  In  [5]  Tracey 
employs  quadrilateral  isoparametric  elements  which  become  triangular  around 
the  crack  tip.  The  displacement  functions  of  the  two  types  of  elements  are 
selected  such  that  displacements  are  continuous  everywhere  and  the  near  tip 
displacements  are  proportional  to  the  square  root  of  the  distance  from  the 
crack  tip. 
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Henshall  and  Shaw  [8]  and  Barsoum  [9]  showed  that  special  crack  tip 
elements  were  unnecessary.  For  two-dimensional  8-node  quadrilateral  ele- 
ments, the  inverse  square  root  singularity  of  the  strain  field  at  the 
crack  tip  is  obtained  by  collapsing  quadrilateral  elements  into  triangular 
elements  and  placing  the  mid-side  nodes  at  quarter-points  from  the  tip. 

The  quarter-point  quadratic  isoparametric  elements  as  singular  elements 
for  crack  problems  have  been  implemented  in  NASTRAN  by  Hussain  et  al  [10]. 

In  order  to  reduce  the  computer  core  requirement  and  to  simplify  the 
modeling  of  a structure,  better  known  but  lower  order  finite  elements  have 
been  abandoned  in  favor  of  cubic  12-node  isoparametric  quadrilateral  ele- 
ments as  described  by  Zienkiewicz  [11].  In  this  paper,  the  concept  of 
quarter-point  quadratic  isoparametric  element  is  extended  to  12-node  cubic 
isoparametric  elements.  The  correct  order  of  strain  singularity  at  the 
crack  tip  is  achieved  in  a simple  manner  by  collapsing  the  quadrilateral 
elements  into  triangular  elements  and  by  placing  the  two  middle  nodes  of 
a side  at  1/9  and  4/9  of  the  length  of  the  side  from  the  tip.  The  12- 
node  isoparametric  elements  have  been  implemented  in  NASTRAN.  Both  mode 
I and  mixed  mode  crack  problems  are  computed  by  NASTRAN  using  the 
collapsed  elements  to  assess  the  accuracy.  The  stability  of  results  is 
discussed  when  the  collapsed  triangular  elements  are  used. 


2.  THE  12-NODE  QUADRILATERAL  ISOPARAMETRIC  ELEMEm'.  A typical  12- 
node  quadrilateral  element  in  Cartesian  coordinates  (x,y)  which  is  mapped 
to  a square  in  the  curvilinear  space  (5*n)  with  vertices  at  (±1,  ±1)  is 
shown  in  Figure  1.  The  assumption  for  displacement  components  takes  the 
form: 


12 

“ “ I Ni(C,n)Ui 

i=l 


12 

V - X Ni(C,Ti)v, 

i-1 


(1) 


where  u,v  are  x,y  components  of  displacement  of  a point  whose  natural  coor- 
dinates are  5,n;  Ui,Vj^  are  displacement  components  of  node  i and  Ni(C,n)  is 
the  shape  function  which  is  given  by  [11] 


NiU.n)  - jfe  * Tinpi-io  * * n*)][-io  ♦ 9C5*  ♦ n*)] 


♦ ^ (1  ♦ Cei)(i  + 9nni)(i  - n*)Ci  - n?) 


♦ ^ (1  + nni)(i  ♦ 95Ci)Ci  - 5*)(i  - 


(2) 


I 


I 


for  node  i whose  Cartesian  and  curvilinear  coordinates  are  and 

(5i,ni)  respectively.  The  details  of  the  shape  functions  and  the  numbering 
sequence  are  given  in  Figure  1. 

The  same  shape  functions  are  used  for  the  transformation  of  coordinates, 
hence  the  name  isoparametric, 

12 

X • I N, (5,n)x, 
i-1 

12 

y - I Nj(5,n)yi 

i-1 

The  element  stiffness  matrix  is  found  in  the  usual  way  and  is  given 
by  [9510] 


[K]  - [B]'^[D][B]  det  |Jld5dn 

-1  -1 

where  [B]  is  a matrix  relating  joint  displacements  to  strain  field 


(4) 


[B]  — [ • • • B^  * • • . ] , 


[Bil  = 


dNj 

0 

dx 

0 

By 

BNj  BNj 

9y  3x 


(5a) 


and  [D]  is  the  material  stiffness  matrix  and  is  given  for  the  case  of  plane 
stress  by 


[D] 


1 - v”* 


1 V 0 

V 1 0 

0 0 (1  - v)/2 


(5b) 


in  which  E is  Young's  modulus  and  V is  Poisson's  ratio. 
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The  Jacobian  matrix  [J]  is  given  by 


[J] 


"ax 

^1 

3Ni 

• • 

35 

35 

• • • • • • 

35 

• • 

■ 

3Ni 

*i  Xi 
• • 

_3q 

3n_ 

L 3n  J 

• • 

(6) 


whenever  the  determinant  of  [J]  is  zero,  the  stresses  and  strains  become 
singular  [8-10].  The  derivatives  of  shape  functions  are 

It  " ^ * nni)[-io  ♦ 9(5?  + np](-i05i  ♦ I85  ♦ 275^5*  ♦ gCin*) 

81 


♦ ^^i^i  * ■ n*)(i  - n|) 


81 


^ (1  + rnii) (1  - 5?) (95^  - 25  - 275^5*) 


dN 

3^  = 2^6  * CCi)[-io  + 9(5*  + n*)](-ioni  ♦ I8n  + 27nin*  ♦ 9ni5*) 


-^riiCi  - 955i)(i  - 5*)(i  - Kl) 

♦ ^ (1  + 5Ci)  (1  - n?)  (9n.  - 2ti  - 27nin*) 


(7a) 


(7b) 


3.  THE  CRACK  TIP  ELEMENT.  In  an  8-node  quadratic  isoparametric  element, 
Henshell  and  Shaw  [8]  and  Barsoum  [9]  found  independently  that  the  strain 
became  singular  at  the  comer  node  if  the  mid-side  nodes  were  placed  at  the 
quarter  points  of  the  sides  from  the  comer  node.  This  singularity  is 
achieved  in  a similar  way  for  a 12-node  isoparametric  element  by  placing  the 
two  middle  nodes  at  the  1/9  and  4/9  of  the  length  of  the  sides  from  the 
common  node  of  two  sides. 

For  sin^licity,  let  us  consider  the  singularity  along  the  side  q » -1 
of  Figure  1.  In  general  the  cubic  mapping  functions  are 

X - Sq  ♦ aj5  ♦ a25*  ♦ 835*  (8) 

u - b.  ♦ b,5  ♦ b25*  ♦ b,5*  (9) 


•5 


A 
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For  5 » -1,  -1/3,  1/3  and  1,  the  corresponding  values  of  x and  u are 

X ■ 0,  ai,  I 
u « Uj,  U2,  Uj, 

The  constants  a's  and  b's  in  terms  of  these  values  of  x and  u are 


*0  = 16 


^ (-1  + 9a  + 93)  , »!  = (-1  - 27a  + 273) 


82  = II  (1  - a - 3)  . 83  = II  (1  + 3a  j 33) 


) (10) 


jf-  (-U,  + 9u-  + 9u,  - u.)  , b,  = TT-  (u,  - 27u,  + 270,  - u.) 


^2  = ('‘l  - “2  - U3  + U4)  . bj  = ^ {-Ui  + 3u2  - 3U3  + U4) 


(11) 


dx 


To  have  singular  strain  at  x = 0 (C  = -1).  the  reduced  Jacobian,  must 
vanish  at  5 = ■!•  From  (8)  we  have  ^ 


d5  “ “1 


ai  + 2a,C  + 3aT5^ 


(12) 


dx 


For  5 » -1,  ^ ® 0 leads  to  the  equation 


B = 2a  + g 


(13) 


du 


In  order  to  have  the  inverse  square  root  singularity  for  ^ , 


E ■ I f ■ ‘‘l  * 


dC 


X must  be  a quadratic  function  of  5 so  that  the  inverse  gives  ^ as  a function 
of  x^'2.  This  leads  to  33  = 0 or 


1 ♦ 3a  - 33  = 0 
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(14) 


The  solution  of  (13)  and  (14)  gives 


0=1/9  and  6 = 4/9  (15) 

Equations  (8)  and  (9)  become 


X = f (1  + a" 

or 

C = -1  + 2g 

(16) 

u = Ui  . i 

(-llUj^  + I8U2  - Ouj  + 

2U4) 

1 

jx  1 (2uj  - 5u2  4u3  - 

U4)  f 

* f (-^1  " 3u2 

- 3^3 

(17) 

From  tl7)  it  is  clear  ^ has  singularity  of  the  order  — at  x = 0. 

The  inverse  square  root  singularity  at  x = 0 along  any  other  ray  emana- 

ting from  node  1 can  be  achieved  by  degenerating  the  quadrilateral  element 
into  a triangular  element  with  the  side  10,  11,  12,  1 collapsed  to  a point 
at  the  crack  tip  and  placing  grid  points  2,  9 at  1/9  and  3,  8 at  4£/9  from 
the  tip.  Figure  2,  where  £ is  the  length  of  the  sides  corresponding  to 
n = ± 1.  For  simplicity  and  without  loss  of  generality  we  take  3=0  and 

the  Cartesian  coordinates  of  nodal  points  as  follows: 


Node 

1 

2 

3 

4 

5 

6 

7 

8 

9 

x/£ 

0 

1 

9 

4 

9 

B 

2+cos  a 

3 

1+COS  Cl 

3 

COS  a 

4cos  a 

9 

cos  Cl 

9 

V/l 

0 

0 

0 

0 

sin  a 

3 

2sin  a 

3 

sin  a 

4sin  a 

9 

sin  a 

9 

Using  (3), 

x/J.  = N.  + Nn  cosa 

* ' (18) 

y/£  = Ng  sincx 

where 

"*  ■ I '*2  * I "3  * "4  * I "s  * I "6  ■ F 

(19) 

"b  ' I "s  * I "6  • N?  ‘ F «s  ‘ F "s,  ■ F ♦ 0 = (1  • n) 


1 


i 
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The  determinant  of  Jacobian  is 


I*  ^ 


3N.  dN  8N.  3Nn  ,2  , 

J ■ = ^ “ Tft  * 5)  sina  (20) 

3x  3^  35  3n  16 

'Sn 

This  shows  the  strain  is  singular  at  x = 0 (C  * -1)  along  any  ray  from 
X ■ 0 since  [j]  = 0 at  5 = -1  for  all  H-  From  (19)  and  (18)  and  using 
polar  coordinates  x/l  = pcos0,  y/J,  = psin6,  where  p = r/i,  we  obtain 


5 = -1  + 2p^  [cos(e  - i)/cos(^)] 


r,  = tan(0  - |.)/tan(|.)  j 

For  a more  general  collapsed  triangular  element  in  Figure  2 when  B 0, 
these  two  equations  take  the  forms 

111  - \ 

5 = -1  ♦ 2R^  , r2  = p2[cos(0  - 3L1-§.)/cos(SLI-1)]2 


n = tan(0  - ^)/tan(”  ^) 


The  displacements  components  u,v  at  a point  (5.n)  of  the  triangular 
element  of  Figure  2 are 

12 

u = Nj^(5.n)ui  = AQ(n,Ui)  + Aj(n,ui)(i  + Q 

♦ A2(n,u.)(i  + 5)^  + Ajdi.u^d  + a® 

12 

V = Z N^(5.n)Vi  = AQ(n,vj^)  + A (n,vj^)(i  + 5) 
i=l 

♦ A2(n,vj^)(i  + 5)®  + Aj(n,vj^)(i  + o® 

where 

AQ(n,Uj^)  ={2(-i  + 9n®)[(i  - n)ui  + (i  + ti)ujq] 

♦ 18(1  - n®)[(i  + 3n)u^j  + (1  - 3n)u^2]}/32 
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The  displacement  derivatives  are 


3u  _ + 1 

3x  35  3x  3ti  3x  (1  + 5)* 


. 3AQ(n,Uj) 


3A, 


* (1  + 5)  I [Aj  - 2(1  + n)^]  + 


3u  3u  35  3u  3t)  1 4[(1  + cosa)  - nCl  - cosa)l 

* fit-  — 


3y  35  3y  3n  3y 


(1  * Q‘ 


£sina 


BAqCti.Uj^)  1 
+ 


3n 


(1  ♦ 5) 


where 

^ ■ 54n^)ui  - (2  - 36n  - 54ti*)Ujq 

+ 18(3  - 2n  - 9n*)Ujj  - 18(3  + 2ti  - 9n*)Uj2 )/32 

Similar  expressions  for  3v/3x  and  3v/3y  with  replaced  by  Vj^. 

It  can  be  seen  that  both  3AQ(n»u.  )/3r|  and  3A»(n,vj^)/3n  vanish  for 
all  n if 

ui  - uio  = uji  = U12  and  vj  = v^j  = v^  = V12  C24) 


Hence  for  the  strain  field  to  have  the  inverse  square  root  of 
singularity  at  r = 0,  the  nodes  1,  10,  11,  12  which  are  collapsed  into 
one  point  must  be  tied  together.  This  is  analogous  to  the  constraints 
given  in  [12]  for  quadratic  isoparametric  elements.  Using  multiple 
constraint  conditions,  equations  (24),  the  displacement  components  at 
(5»n)  relative  to  the  tip  may  be  written  in  the  form 

I 

U - [36Fi(n,up  + FjCn.u^)  ♦ 36{F2(n,Ui)  - Fj(n,Ui)} 

1^ 

r2  - 36F2(n.Ui)Rl  (25) 

1_ 

V * f36Fj(n,Vj^)  + Fj(n,Vj)  ♦ 36{F2(n,v^)  - Fj(n,Vj^)} 

1_ 

r2  - 36F2(ti,v^)R]  (26) 
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(27) 


* 4.  DETERMINATION  OF  STRESS  INTENSITY  FACTORS.  The  collapsed  triangular 
elements  around  the  crack  tip  have  the  correct  order  of  singularity  at  the 
tip.  The  continuity  of  displacement  components  is  insured  throughout  the 
region  and  because  of  the  use  of  higher  order  polynomial  for  the  displace- 
ment field,  the  nodal  displacements  obtained  from  the  finite  element  method 
should  be  quite  accurate.  If  nodal  displacements  are  substituted  into  the 
left  hand  side  of  the  well  known  near  crack  tip  displacement  formula  [13]. 

n-1  n-1/2 

2Gu(0)  = I {(-1)  r ' [d  D (n,0)  + a A (n.0)] 
n=1.2....  2n-l  uj  2n-l  ul 


n n 


+ (-1)  r [d2  D Cn,0)  + 


(28) 


2Gv(0)  = I {(-l)"’^"'^^^[d2„.iDyl(n.0)  + a2^./y^(n.0)] 
n=l  ,2 , . . . 

the  coefficients  d's  and  a's  can  be  approximately  determined  by  a finite 
number  of  terms  on  the  right  hand  sides  of  (28)  and  (29).  The  stress 
intensity  factors  and  K2  are  related  to  di^  and  a^  by 


K,  = -6,/P:n  . K-  = -a,/Tir  (30) 


In  C28)  and  (29), 


Dui(n,0)  = (n  - l/2)cos(n  - y)©  - (k  + n - yDcosCn  - i)0 
Du2(n,0)  = ncos(n  - 2)0  - (<  + n ♦ l)ccsn0 

A^j(n.0)  = (n  - l/2)sin(n  - |)0  - (<  + n + l/2)sin(n  - 1/2)0 
Au2(n,0.)  = nsin(n  - 2)0  - (k  + n - l)sin0 

Dyj^(n,0)  = -(n  - 1/2) sin (n  - |-)0  - (k  - n + y)sin(n  - j)0 
~ -nsinCn  - 2)0  - (k  - n - l)sin  n0 

Ayi(n,0)  = (n  - y)cos(n  - |•)0  + (k  - n - l/2)cos(n  - 1/2)0 

Ay2(n»0)  = ncos(n  - 2)0  + (k  - n + l)cos  n0  (31) 

wh<;re 

((3  - V)/ (1  + V)  for  plane  stress 

3 - 4v  for  plane  strain  (32) 


There  are  a number  of  ways  to  estimate  the  stress  intensity  factors  from 
the  finite  element  displacements  near  a crack  tip.  On  the  right  hand  side  of 
(28)  and  (29),  we  may  retain  only  term  or  r^/^,  r,...  up  to  rP  terms, 

and  on  the  left  hand  side  we  may  use  u and  v actually  obtained  from  (25)  and 
(26)  or  only  the  part  of  u and  v which  correspond  to  r^' ^ term  in  (25) , (26) 
[14,15].  Detaj.led  discussions  are  given  in  [16].  The  simple,  yet  accurate 
way  to  obtain  the  stress  intensity  factors  is  the  use  of  v(ro,Tr)  and  v(rQ,  -tr) 
for  and  u(rQ,iT)  and  u(ro,  - t;)  for  K2  [16]. 


K,  K (-7T)  = - ^tro.-TT) 

1 1/2  ' I*- 


(K  + 1) 


1/2 


(K  ♦ 1) 


Ki(Tr)  + Ki(-tt) 


(33) 


L 
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K^Cn) 


_ 2G  u(to, IT) 
* 1) 


K2(-Tr) 


-f/SiT  2G  u(ro,-Tr) 

+ 1) 


>4 


K2(ir)  ♦ K2(-Tr) 
2 


(34) 


where  uCrQ.ir),  vCtq.tt)  are  rectangular  components  of  displacement  of  the  node 
at  r = rQ,  0 = ir  relative  to  the  node  at  the  crack  tip  referring  to  the  local 
coordinates  with  crack  tip  as  origin  and  the  crack  on  the  negative  x-axis. 
This  technique  gives  good  results  if  1%  of  the  crack  length  is  used  for  ro 
and  Z = 9Tq.  For  a mode  I crack,  is  given  by  either  K^Cir)  or  K]^{-ir), 

K2  is  zero.  For  a mode  II  crack,  Ki  is  zero  and  K2  is  given  by  either 
K2(Tr)  or  K2 (-it). 


5.  NASTRM  IMPLEMENTATION.  The  NASTRAN  implementation  of  the  12-node 

quadrilateral  follows  that  of  the  8-node  quadrilateral  as  described  in  [10]. 
The  dummy  user  element  facility  of  NASTRAN  is  used.  This  requires  coding 
routines  to  calculate  element  stiffness  matrices  and  stress  recovery 
computations.  Modifications  to  existing  NASTRAN  source  code  are  made  to 
provide  proper  output  formats  for  the  element.  Stress  intensity  factors 
for  mode  I and  II  are  calculated  using  equations  (33)  and  (34).  *A11  stiff- 

ness computations  are  performed  in  double  precision  while  stress  recovery 

is  performed  in  single  precision.  Element  stiffness  matrix  computation 
requires  10  seconds/element  on  an  IBM  360/44. 

6.  NUMERICAL  RESULTS.  Three  mode  I and  one  mixed  mode  crack  problems 
are  chosen  for  numerical  computation  of  stress  intensity  factors.  The 
geometries  and  loads  of  mode  I tension  test  specimens  are  given  in  Fig.  3. 
The  idealization  of  a half  of  the  single  edge  crack  is  shown  in  Fig.  4.  A 
similar  idealization  is  used  for  a quadrant  of  a center  crack  or  a double 
edge  crack.  Three  collapsed  triangular  elements  surrounding  a mode  I 
crack  tip  are  shown  in  Fig.  5.  Nodes  1 through  10  are  numbered  counter- 
clockwise similar  to  nodes  19  through  28  but  they  are  coincide  with  the 
crack  tip.  At  the  crack  tip,  the  multiple  constraint  conditions  given  by 
equations  (24)  are  either  applied  or  not  applied.  The  multiple  constraint 
has  little  effect  on  numerical  results  of  stress  intensity  factors  for  the 
test  problems  studied  here  (see  table  1).  This  is  probably  because  the 
differences  among  uj,  U2,...ujq  and  among  v^,  V2...,vig  are  very  small 

in  the  elastic  range  when  the  nodes  1,  2,..., 10  are  not  tied  together. 


For  each  specimen  a reference  value  of  Kj  is  used  for  normalization. 
For  a central  crack  with  a/b  = 0.4,  Kj  = 1.966  [17]  is  taken  as  exact. 

The  exact  value  for  the  double  edge  crack  is  Kj  = 2.00  [18],  and  for  the 
single  edge  crack  is  Kj  = 3.728  [19.  using  1 st  formula  for  F(a/b)  on 
page  2.11].  Table  1 gives  ratios  of  Kj  obtained  from  cubic  isoparametric 
elements  to  the  corresponding  exact  value  of  Kj  for  various  values  of  Tq/^ 
where  rp  is  the  distance  between  the  crack  tip  and  the  nearest  node  and  a 
is  the  crack  length. 


* 


Three-point  Gaussian  quadrature  is  normally  used  to  calculate  each  partial 
integration  of  the  double  integral  (4).  As  an  option,  four-point  Gaussian 
quadrature  may  be  used  instead. 
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TABLE  1 . Kj (NASTRAN) / Kj (EXACT) 


Center  Crack 
a/b  = 0.4,  H/b  = A 
Exact  Kj  = 1.966 


Double  Edge  Crack 
a/b  = 0.4,  H/b  = A 
Exact  = 2.00 


Single  Edge  Crack 
a/b  = 0.4,  H/b  = A 
Exact  Kj  = 3.728 


0, 

.01 

0. 

015 

0.02 

No 

Yes 

No 

Yes 

No 

Yes 

0.981 

1.013 

0.982 

0.999 

0.983 

0.994 

1.000 

1.021 

0.998 

1.007 

0.999 

1.002 

0.980 

1.003 

0.980 

0.991 

0.982 

0.988 

An  obliqued  edge  crack  in  a rectangular  panel  under  uniform  tension  is 
solved  by  Freese  using  modified  mapping  collocation  method  [20] . The  NASTRAN 
program  is  used  to  solve  the  combined  mode  I and  mode  II  crack  problem  using 
six  collapsed  triangular  elements  around  the  crack  tip  as  shovm  in  Figure  6. 
For  a 45®  edge  crack  with  a/b  = 0.4,  H/b  = 2.0,  Kj  and  K2,  from  readings  of 
Bowie's  graphs  (Figure  1 - 16(a)  and  (b)  of  [20]),  are  approximately  1.86 
and  0.88.  The  idealization  and  boundary  conditions  of  the  slant  edge  cracked 
panel  are  shown  in  Figure  7.  Numerical  results  of  Kj  and  K2  are  tabulated 
in  Table  2 for  Tq/sl  = 0.01  and  for  various  other  conditions.  Again  the 
multiple  constraint  conditions,  namely  uj  = U2  = . . . = Uj^g  and  Vj^  = Vo  = 

...  = vjg,  give  little  effect  on  values  of  Kj^  and  K2.  Numerical  results 
using  cubic  isoparametric  elements  for  the  test  problems  can  also  be  found 
in  [16,21]. 

TABLE  2.  Kj  AND  K2  FOR  45®  EDGE  CRACK  BY  NASTRAN 


i 


7.  THE  STABILITY  OF  COLLAPSED  TRIANGULAR  ELEMENTS.  In  a recent  report 
by  Hussain  and  Lorensen  [22],  it  was  found  that  a slight  perturbation  in 
placing  the  mid-side  node  opposite  to  the  crack  tip  for  a collapsed  8-node 
quadrilateral  element  led  to  unstable  results  in  stress  intensity  factors. 
This  iinstability  can  be  shown  in  the  collapsed  12-node  quadrilateral  element 
if  one  or  both  middle  nodes  of  the  side  opposite  to  the  crack  tip  been 
slightly  perturbed  from  their  nominal  positions. 

Let  node  5 be  perturbed  as  shown  in  Figure  8.  Denoting  the  perturbed 
quantities  with  an  asterick  we  have 


X-/1 


2 + COSO 


(38) 


* sina  . pi 

A general  point  (x,y)  given  by  equation  (18)  will  be  displaced  at 


x*/£  = (1  + 0*[(1  - n)  + (1  + n)cosa]  + e ^ (1  + Q(1  - n*)(i  - 3n)  (39) 


y*/i  = |-  (1  + Q^(l  + Ti)sino  + (1  + Q(1  - Ti^)(l  - 3n)  (40) 

Along  the  line  q = -1/3,  and  replace  y*  by  r sin©  in  (40)  we  have 


1 + 


K = 


3e' 

sina 


+ 


4sin0sina 

3e'^ 


(41) 


Since  (1  + C)  is  a common  factor  in  displacement  components,  equations  (25) 
and  (26),  it  is  seen  that  the  singularity  required,  for  the  crack  problems 
disappears  along  at  least  the 'ray  q = -1/3  in  the  collapsed  triangular  case. 


As  a numerical  example,  the  central  crack  tension  specimen  of  Figure  3 
(a)  is  again  used.  If  the  idealization  remained  the  same  as  shown  in 
Figure  4 except  that  the  collapsed  elements  of  Figure  5 were  replaced  by 
those  of  Figure  8b  (where  nodal  points  20,  21,  23,  24,  26  and  27  are  on  a 
circular  arc),  the  computed  stress  intensity  factor  changed  from  its  almost 
exact  value  = 1.962  to  = 1,421  (nearly  30%  error).  If  only  nodal 
points  26  and  27  were  perturbed  to  their  new  locations  of  Figure  8b,  the 
stress  intensity  factor  would  become  Kj  = 1,457  (a  26%  error). 


8.  CONCLUSIONS.  By  a simple  manner,  the  12-node  isoparametric 
elements  can  be  used  to  form  a singular  element  for  two-dimensional  elastic 
fracture  mechanics  analysis.  The  elements  have  been  successfully  implemented 
in  NASTRAN  which  can  now  be  more  efficiently  used  for  more  accurate  predic- 
tion of  stress  intensity  factors  of  complicated  crack  problems.  The 
middle  nodes  of  the  side  opposite  to  a crack  tip  in  a collapsed  triangular 
element  should  be  accurately  located  to  avoid  unstable  results.  The 
extension  of  collapsed  triangular  elements  as  singular  elements  to  three- 
dimensional  brick  elements  can  be  easily  done  as  in  [9,10]. 
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N?  = 32  (i^n)(i+5)[-io+9(5*+n")] 


= (l-n)Cl-C)[-10+9(5"+ri^)] 

= ± (i-n)(i-C"Ki+3C) 

= i (i-n)(i+C)[-io+9(c^Ti")] 
= ^ (i+C)(i-n"Hi-3nD 
= ^ aH)(i-n^Mi+3n) 


Ng  = (i+nKi-5^Ki+35) 

Ng  = ^ {i+n)(i-C")(i-3C) 

Nio  = jf  (i+n)(i-C)[-io+9(5"+n") 
Nil  “ ^ (i-C)(i-n')(i+3Ti) 

Ni2  = {1-Q(1-ti^)(1-3T1) 


Figure  1.  Shape  Functions  and  Numbering  Sequence  For  a 12-Node 
Quadrilateral  Element. 
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Figure  2.  A Normalized  Square  in  (C»n)  Plane  Mapped  Into  a Collapsed 
Triangular  Element  in  (x,y)  Plane  with  the  side  C = "1 
Degenerated  into  a Point  at  the  Crack  Tip. 
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Figure  5. 


Three  Collapsed  Triangular  Elements  Surrounding  a 
Mode  I Crack  Tip. 
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Figure  7.  Idealization 


Figure  8(a).  Node  5 Perturbed  to  5*. 

(b).  Nodes  20,  21,  23,  24,  26,  27  Perturbed  From 
Their  Nominal  Positions. 
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ABSTRACT.  Let  H he  a real  Hilbert  space  with  symmetric  inner  product 
< , > , and  let  g e H be  an  arbitrary  vector.  Upper  and  lower  bounds  of 
variational  type  are  presented  on  the  quantity  <g,^)  associated  with  the 
solution  ii  f H of  an  arbitrary  equation  F((i  = 0.  The  bounds  are  based  on  a 
bivariational  approximation  to  (g,(|i>,  namely 

J(H',4>)  = F4'>  + < g,4>>  , 

and  do  not  depend  on  any  decomposition  of  the  operator  F.  Applications  to  both 
linear  and  non-linear  problems  are  indicated. 

I.  INTRODUCTION.  Let  A be  a self-adjoint  operator  in  a real  Hilbert 
space  H with  inner  product  < , ) . Associated  with  the  linear  equation 

A(J>  = f (1) 

in  H is  the  well-known  Rayleigh-Ritz  variational  functional 

RC^)  = -(4',A4>>  + 2<f,4'>,  9 e H , (2) 

which  is  stationary  about 

R((t>)  = <f,(|)>  (3) 

for  variations  in  4’  around  41,  the  solution  of  (1).  If  A is  strictly 
positive,  so  that  for  all  ^ e H 

a<4’,4>)  ^ <4>,A4>)  ^ B > 0 , (4) 

then  variational  bounds 

RC*)  + -||a<J>  - f||^  < <f,(f,>  < R(4>)  + ^||a4>  - f||^  (5) 

a — — D 

* 
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then  instead  of  (5)  we  have  the  variational  bounds 


R(4>)  - i ||a*  - f 11^  < <£,*>  1 RC*)  + ^ Ha*  - f 11^  - (7) 

Results  like  (5)  and  (7)  are  useful  when  ^ cannot  be  found  exactly,  but 
upper  and  lower  bounds  on  the  quantity  <f,4i>  are  required.  Other  variational 
bounds  can  be  derived  which  depend  on  decomposing  all  or  part  of  the  linear 
operator  A in  the  form  T*T  such  decompositions  are  not  always 

available,  or  convenient  (e.g.  A might  represent  an  integral  operator,  or  an 
unwieldy  differential  operator;  or  equation  (1)  might  stand  for  a batch  of 
simultaneous  equations) . For  simplicity  and  generality  we  do  not  consider  them 
here . 

The  question  arises:  can  we  find  corresponding  bounds  on  <g,^)  for 
arbitrary  g e H? 


II.  LINEAR  PROBLEMS. 


(i)  A self-adjoint 

Consider  the  pair  of  equations  in  H 

Atj)  = f , A\ii  = g 

for  which 

< g,(^>  = <f  ,*>  . 

Since  for  any  parameters  s and  t we  have  the  identity 

< sf  + tg,S((i  + tt())  - ( sf  - tg,s*  - ti)/>  = 4st(g,(|i> 


(8) 


(9) 


(10) 


appropriate  subtraction  of  two  pairs  of  bounds  like  those  in  (5)  for  each  of  the 
combined  equations 


ACs*  ± ti(/)  = sf  ± tg 


(11) 


leads  to  bounds  on  (g,*).  If  A satisfies  (4),  and  the  ratio  s:t  is 
optimized,  we  obtain  the  result 


j + Cs  - nc  <^  < g,4i  > < J + Cs  + nc 


(12) 


with 
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y 


A4i>  + + (g.t’}  = <g,4,>  - <6lJ/,A6i(,>  , 

S(y,4)  = (A*  - f,A'i'  - g>  = <A64i,A6iJ;>  , 

CC-i-.t)  = ||a$  - f||  ||Ay-  g||  = |1a6^||  ||a6K;||  , 

6(Ji  = 4>  - 6ij<  = y - ij)  , 


(13) 

(14) 

(15) 

(16) 


and 


^ 2 B a^'  2^6”a^’ 


(17) 


The  bounds  in  (12)  are  bivariational  in  character,  and  reduce  to  (5)  when  V = 4> 
and  f = g.  If  A satisfies  (6)  rather  than  (4),  we  ta)te  a = -y,  6 = +Y 
in  the  foregoing,  to  generalize  (7) . 


The  functional  J('!',<1')  is  a bivariational  approximation  to  and 

it  generalizes  the  Rayleigh-Ritz  variational  functional  R(4i). 


(ii)  A not  self-adjoint 


When  A is  not  self-adjoint,  the  bivariational  approximation  J(y,4') 
is  still  available,  but  is  now  associated  with  the  pair  of  equations 


A(ti  = f,  A*ij;  = g , 


(18) 


A*  denoting  the  Hilbert  space  adjoint  of  A.  If  A satisfies  condition  (6) , 
it  can  be  shown  [4]  that  the  bivariational  bounds 


J--C<<g,<ti><J  + -C 


(19) 


hold,  with 

C('f,4')  = ||a4>  - fll  - g||  = ||a6().||  ||a*6i*/||  . 

Rather  better  bounds  can  be  derived  whenever 
1 


(20) 


<<1>,(A  + A*)*)  > 6<4>,4>>,  6 > 0,  for  all  0 e , 


(21) 


i.e.,  whenever  the  self-adjoint  part  of  A is  strictly  positive.  Then  it  can 
be  shown  [5]  that 


J-^  (C-S)  + (C  + S) 


(22) 


with 


S(H',4')  = < A4>  - f ,A*'l'  - g> 


(23) 
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The  different  bounds  in  (12) , (19)  and  (22)  are  all  obtained  by  adding  to 
or  subtracting  from  the  bivariational  functional  J appropriate  "correcting" 
functionals,  which  are  themselves  bivariational  approximations  to  zero.  Other 
techniques  - 7^]  depend  on  constraining  the  vectors  ♦ and  so  that  the 
second  order  term  < 6\l/, a6^)  in  J taltes  a definite  sign.  Decompositions  of 
the  operator  A are  usually  involved. 

III.  NONLINEAR  PROBLEMS 

The  idea  of  "correcting"  a bivariational  approximation  to  <g,(J)>  works 
for  arbitrary  nonlinear  problems  of  form 


F()i  = 0 


(24) 


in  ff,  provided  that  F satisfies  reasonable  conditions.  The  functional 

J(1',4>)  = F4>>  + ( g,4>>  (25) 

generalizes  that  in  (13) , and  is  associated  with  the  pair  of  equations 

F(()  = 0,  F'  ((P)*il)  = g , (26) 

F'  (^)  being  the  linear  Gateaux  derivative  of  F at  (Ji.  Regarding  JCl'.t)  as 
a mapping  from  H ^ H into  the  reals,  its  Gateaux  derivative  at  (y,#)  is 
described  by  the  mosaic 


J 


_F$ 

- F' 


(27) 


and  thus  will  be  the  null  operator  when  (’l',4>)  = (i(»,((i),  the  solutions  of  (26). 
The  true  variational  character  of  J(’l',$)  is  thus  established.  The  original 
problem  (24)  is  embedded  into  the  larger  problem  (26) , with  S'  playing  the  role 
of  a sort  of  Lagrange  multiplier.  Full  details  of  the  analysis  are  given  in  (8). 


It  is  interesting  to  note  that,  if  S'  can  be  chosen  in  terms  of  C*  so  that 


F’  (4')*S'  = g , 


(28) 


then  formally 

J = <g,{4i  - F' (4>)"^F«>}>  = <g,N4'>  , (29) 

N#  being  Newton's  approximation  to  (|i.  The  functional  J thus  generalizes 
Newton's  approximation  in  a certain  sense,  and  it  does  not  demand  knowledge  of 
unpleasant  inverses. 

To  establish  bivariational  bounds  on  <g,4>)  in  the  form 

J-C<(g,())><J  + C,  (30) 
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for  all 


*’l'*2  ‘ ^ ^ ^ ^ V C jrC  D(K)  n D(F’*)  , (33) 

Then  it  is  straightforward  (see  (8))  to  show  that  a suitable  'correcting' 
functional  is 

C(y,*)  = i |1f*||  ||f'  (4>)*'»'  - g||  + -^2  K(y)  1|f4«||^  . (34) 

2c 

We  note  that  condition  (31)  is  required  even  if  F is  linear  (cf.  (6)). 

It  implies  uniqueness  of  ^ in  S,  and  holds  for  example  when  F4>  = - 41 

where  G is  a contraction  operator,  or  when  F^i  = A4i  + f(<ti)  with  A self- 
adjoint  and  ]x>unded  as  in  (4),  and  df/d(^  suitably  bounded.  Condition  (32) 
indicates  that  the  nonlinearity  of  F is  not  too  fierce.  It  is  satisfied  if 
either 

|(f4j^  - F*^  - F' (4^)  - 4>2]  II  1 I - *211^  (with  KC^)  = )c|jy||)  (35) 

or  (for  function-spaces  with  suitedsle  innerproducts) 

|f4.  - F4)  - F'  (4>  ) [4>  - 4>,)  I < i )c'*’|4>,  - 4>_|^  (with  K('i')  = lt'''|'F|  ) . (36) 

1 2 112'—  2'12'  ''  sup 

In  a practical  situation,  the  cirtical  tas)c  can  be  to  find  a suitable  subset 
S of  the  domain  of  F for  which  the  conditions  (31)  and  (32)  hold  good. 


IV.  ILLUSTRATIVE  EXAMPLES 


Reactor  problems 


The  different  bounds  for  linear  problems  can  be  illustrated  by  reference 
to  neutron  diffusion  in  a reactor,  described  by  the  equations 


2„2. 


- o‘'V‘-)4(r,t)  +11;=  f (r,t) , 0<t<T,  rev. 


(37) 


i(r,t)  =0  on  3V  for  all  t,  4(^,0)  = 0 for  all  r 


The  total  number;  of  neutrons  absorbed  in  time  T is  proportional  to  <1,4^ 
where 


T 

< ♦ ,♦  > = / / 4>  (r,t)*  (£,t)d£dt 

0 V 


(38) 
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In  a steady-state  situation,  bounds  on  ( 1,4>)  can  be  found  from  the  simple 
Rayleigh-Ritz  approach  in  I if  f (r ) is  constant  (cf.  [£] ) ; otherwise  the 
theory  of  II  (i)  can  be  used  when  f (r)  is  varying.  In  a time-dependent 
situation,  the  theory  of  II (ii)  applies  with  (38)  as  inner  product  (see  (4]). 

Pointwise  bounds  on  solutions 


By  taking  g as  a suitad>le  kernel  function,  it  is  often  possible  to  use 
bivariational  theory  to  obtain  pointwise  bounds  on  solutions.  For  example,  if 
equation  (1)  represents  the  Fredholm  integral  equation 

b 

<(>(x)  + A / k(x,y)(()(y)dy  = f (x)  , (39) 

a 


and  if  g(x)  = k(x',x),  bounds  on  (k,4i)  lead  to  bounds  on  4i(x')  • If 

k is  symmetric,  the  auxiliary  equation  in  (8)  is  actually  the  one  specifying 
the  reciprocal  kernel,  and  interesting  theoretical  developments  ensue.  The 
bivariational  method  is  a simple  yet  powerful  practical  tool  in  this  situation  (9)  . 

The  same  kind  of  approach  can  be  used  for  differential  equations,  with  g 
as  the  Green's  Function  for  a suitably  simple  part  of  the 
operator  [10] . 

Nonlinear  problems 

The  nonlinear  diffusion  equation 

1 2 

- — I + (1  + (|>)  + ^ (1  + (Ji)  =0,  -1  < X 

dx 

<t>(-l)  = 0,  (KD  = 0 

is  discussed  in  (£]  , and  bounds  on  <l,4i)  are  evaluated, 
be,  for  exctmple,  the  amount  of  heat  stored  in  a metal  bar 

temperature  zero  which  was  suffering  a nonlinear  heat-loss  to  the  surroundings 
at  temperature  -1.  A monotonicity  theorem  can  be  employed  to  determine  a suit- 
able subset  5 for  the  determination  of  the  critical  constants  in  the  'correct- 
ing' functional  C. 


differential 


< 1, 


(40) 


This  quantity  could 
with  ends  kept  at 


The  nonlinear  integral  equation 


7r/2 

/ 


0 


sin(x  - y) 
w(x  - y) 


♦ (y)dy 


1 

i|>  (x) 


(41) 


occurs  in  communication  theory,  and  the  cosine  transform  (cos  px,4i(x)>  of 
(x)  is  proportional  to  the  signal  strength.  Bivariational  bounds  on  this 
quantity  are  reported  in  [8] . 


Applications  to  wide  classes  of  problems  are  evidently  feasible,  and  much 
development  remains  to  be  done. 


V.  SOME  GENERALIZATIONS.  Instead  of  {g,4i)  in  the  foregoing,  we  can 
work  with  a more  general  inner  product  g((Ji).  The  auxiliary  equation  in  (26) 
must  be  replaced  by 

F'  (({i)  = g'  (((,)  . (42) 

It  is  possible  to  construct  a suitable  correcting  functional  when  g(^)  has  a 
bounded  second  derivative. 

Adaptations  can  be  made  to  the  theory  to  take  explicit  account  of  boundary 
terms  occurring  in  the  specification  of  adjoint  operators  (cf.  [^“^1 ) • 
Generalization  to  complex  spaces  is  not  difficult  (see  [7^,  ^] ) , and  the  theory 
can  be  set  in  more  abstract  spaces  which  need  not  even  be  normed  (see  [12]). 

The  bivariational  method  has  proved  efficient  in  calculating  dynamic  polarizabil- 
ities for  two-electron  atoms  at  arbitrary  complex  frequencies  ( [13] ) . 

REFERENCES 

1.  B.  Noble  and  M.  J.  Sewell,  1972,  J.  Inst.  Maths.  Applies.  9,  123-193. 

2.  A.  M.  Arthurs,  1970,  Complementary  variational  principles,  Oxford  - 
Clarendon  Press. 

3.  P.  D.  Robinson,  1971,  in  Nonlinear  functional  analysis  and  applications, 
ed.  L.  B.  Rail,  Academic  Press,  507-576. 

4.  M.  F.  Barnsley  and  P.  D.  Robinson,  1976,  Proc . Roy.  Soc . Edinb.  75A,  9, 
109-118. 

5.  M.  J.  Sewell  and  B.  Noble,  1976,  General  estimates  for  linear  functionals 
in  nonlinear  problems,  MRC  Report  #1703. 

6.  W.  D.  Collins,  1976,  Dual  extremum  principles  for  dissipative  systems, 

MRC  Report  #1624. 

7.  P.  D.  Robinson,  1977,  New  variational  bounds  on  generalized  polarizabilities, 
MRC  Report  #1725. 

8.  M.  F.  Barnsley  and  P.  D.  Robinson,  1977,  Bivariational  bounds  for  non- 
linear problems,  J.  Inst.  Maths.  Applies.,  in  press. 

9.  P.  D.  Robinson,  1977,  Pointwise  bounds  on  solutions  of  Fredholm  integral 
equations,  MRC  Report  #1767. 

10.  M.  F.  Barnsley  and  P.  D.  Robinson,  1976,  J.  Math.  Anal.  Applies.  172-84. 

11.  M.  F.  Barnsley  and  G.  A.  Baker,  Jr.,  1976,  J.  Math.  Phys.  2J_,  1019-1027. 

12.  I.  Herrera  a,nd  M.  J.  Sewell,  1977,  Dual  extremum  principles  for  non- 
negative unsymmetric  operators,  MRC  Report  #1743. 

13.  F.  Weinhold  and  P.  D.  Robinson,  1977,  Bivariational  calculations  of  bounds 
for  complex-frequency  polarizabilities,  submitted  for  publication. 


189 


SOME  GENERIC  PROPERTIES  OF  A LOGIC  MODEL 
FOR  ANALYZING  HARDWARE  MAINTENANCE  AND  DESIGN  CONCEPTS* 

James  T.  Wong  William  L.  Andre 

Mathematician  Research  Engineer 

U.S.  Army  Air  Mobility  Research 
and  Development  Laboratory 
Ames  Research  Center 
Moffett  Field,  California 

ABSTRACT 

A mathematical  structure  for  diagnostic  logic  modeling  was  formulated, 
which  allows  the  intrinsic  properties  of  a complex  Logic  Model  to  be 
studied  in  an  abstract  setting.  As  a result,  it  was  found  that  a 
loop-free  Logic  Model  is  a partially  ordered  set  and  that  every 
permutation  of  the  elements  in  the  terminal  set  of  a finite  partially 
ordered  set  S partitions  S into  disjoint  subsets.  Based  on  these 
results,  it  was  deduced  that  the  minimum  number  of  test  points  required 
for  conclusive  detection  of  malfunctioning  components  for  a loop-free 
system  is  equal  to  the  number  of  elements  in  the  terminal  set;  this  set 
constitutes  the  optimal  choice  for  test  points.  Also,  it  was  established, 
for  each  permutation  of  the  elements  in  the  terminal  set,  a relative 
failure  diagnostic  strategy  was  defined  in  accordance  with  Bellman's 
Principle  of  Optimality.  Finally,  for  the  purpose  of  illustration,  some 
examples  are  given. 
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COMPUTER  GRAPHICS  IN  A PRODUCTION  ENVIRONMENT 


William  D.  Johnston 
Analysis  and  Computation  Division 
National  Range  Operations  Directorate 
US  Army  White  Sands  Missile  Range,  NM 


Abstract 

This  paper  examines  the  mathematical  and  software  techniques  required  for 
an  efficient,  high-production  graphics  system.  It  is  shown  how  appropriate 
design  of  both  software  and  mathematical  procedures  will  lessen  the  workload 
of  the  customer  and  reduce  throughput  time,  while  satisfying  all  of  the  pro- 
duction requirements.  Such  procedures  are  designed  to  isolate  the  causes  of 
irregularities  that  may  arise  during  the  computer  solution,  and  give  the  user 
explicit  directives  as  to  corrective  action.  Also  discussed  are  the  software 
and  operational  techniques  required  for  information  security  purposes  within 
the  graphics  system,  as  well  as  the  methods  of  software  acquisition  for  spe- 
cific areas  of  the  system. 

Although  primary  emphasis  is  on  mathematics  and  software,  as  applied  to 
production  graphics  systems,  this  paper  also  examines  the  following  areas: 

a.  The  classification  of  production  vs.  demand  graphics. 

b.  User  requirements  and  demands. 

c.  System  operational  procedures. 

d.  Hardware. 

e.  The  evolution  of  the  White  Sands  production  graphics  system--an 
existing  high-production  system  that  produces  15,000  plots  per  month. 


1.  Production  vs.  Demand  Graphics 


The  need  for  graphics  falls  generally  within  these  two  categories,  with 
each  type  fulfilling  specific  requirements.  Since  the  needs  of  the  organiza- 
tion will  dictate  whether  a production  system  is  desirable,  it  is  important 
to  distinguish  between  the  two  systems  and  define  their  characteristics. 

The  demand  (or  interactive)  graphics  system  has  in  recent  years  become 
an  extremely  powerful  and  highly  developed  tool  in  which  the  user  normally 
accesses  a real-time  video  terminal  from  which  he  can  produce  hard  copy  prints 
with  an  appropriate  plug-in  device.  When  equipped  with  or  attached  to  a 
processor  (micro  or  otherwise),  the  images  may  be  edited,  rotated,  or  other- 
wise modified  through  keyboard  or  light-pen  entry.  Demand  graphics  systems 
find  their  utility  in  research  and  development,  interactive  graphical 
analysis,  and  in  any  situation  where  the  user  requires  instant  presentations 
or  prints  in  relatively  small  quantities. 

Production  graphics  systems  on  the  other  hand  are  allied  with  the  need 
for  systems  which  can  generate  large  quantities  of  plots,  where  the  plot 
Itself  is  not  a tool  in  the  development  process,  but  a graphical  overview  of 
some  operation  or  event.  To  that  end,  the  plot  is  not  intended  to  be  the 
source  of  specific  samples  of  data,  and  it  is  neither  expected  nor  intended 
that  the  analyst  will  apply  a ruler  to  the  plot  to  extract  specific  and 
precise  data  from  it.  For  that  purpose  there  exists  the  detailed  point-by- 
point data  listing  generated  by  the  data  processing  equipment.  The  plot, 
meanwhile,  provides  the  analyst  with  an  overall  picture  of  the  event,  and 
quickly  reveals  trends,  disruptions  or  deviations,  and  the  probable  outcome 
of  the  event  being  plotted. 

The  need  for  a true  production  graphics  system  exists  anywhere  there 
is  a large  volume  of  data  being  generated  and  where  there  is  a need  for 
graphics  to  either  supplement,  or,  to  a limited  extent,  stand  in  for  the 
actual  data  listings.  In  these  cases,  it  is  desirable  that  the  plots  be 
generated  through  the  use  of  standardized  procedures  requiring  little  action 
on  the  part  of  the  initiating  party.  Furthermore,  the  graphics  system  itself 
should  draw  all  titles,  numerical  annotation,  and  other  legends  on  the  plots 
so  that  there  is  no  need  for  a clerical  post-processing  step.  In  other  words, 
when  the  plot  comes  out  of  the  graphics  system,  it  should  be  ready  for  imme- 
diate insertion  in  the  final  data  report. 

The  important  factor  in  deciding  whether  a production  system  will  be 
beneficial  to  a particular  organization  is  the  volume  of  graphics  to  be 
produced,  including  the  production  of  graphics  that  are  desired,  but  not 
possible  or  practical  under  the  existing  system.  If  the  operation  requires 
relatively  few  plots  (on  the  order,  say,  of  one  or  two  hundred  per  day),  or 
if  there  is  a need  for  interactive  analysis,  a small  demand  system  will 
probably  be  quite  effective.  Conversely,  an  organization  that  reduces  large 
volumes  of  data  and  produces  numerous  reports  containing  graphics  will  find 
the  efficiency  of  a production  system  very  attractive. 
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II.  User  Requirements  and  Demands 


The  customer's  demands  often  exceed  his  requirements,  and  it  is  the 
responsibility  of  the  production  manager  to  decide  which  services  can  and 
should  be  provided.  These  decisions  affect  both  hardware  and  software, 
and  can  be  intelligently  made  only  after  close  consultation  with  the  users 
themselves.  Obviously,  the  requirements  may  vary  widely  from  one  installation 
to  the  next,  and  specific  recommendations  for  one  operation  could  well  be 
partially  or  wholly  invalid  for  another.  Nevertheless,  there  are  several 
general  points  that  must  be  reconciled  between  the  production  manager  and  the 
graphics  customers. 

The  first  and  most  important  of  these  is  turn-around  time.  The  user 
invariably  wants  fast  turn-around,  and  indeed,  it  may  be  possible  to  provide 
finished  plots  in  a matter  of  hours,  or  even  minutes.  But  this  might  well 
require  keeping  an  equipment  operator  on  duty  at  all  times,  when  it  would 
actually  be  more  efficient  to  group  the  graphics  jobs  so  that  the  actual 
plotting  is  done  only  at  certain  regular  times.  Such  a system  generally 
guarantees  turn-around  on  some  basis  such  as  four,  eight,  twelve,  or  twenty- 
four  hours. 

In  actual  practice,  there  are  very  few  cases  which  require  turn-around 
in  less  than  twenty-four  hours,  through  the  user  may  think  otherwise.  The 
primary  method  to  reduce  the  customer  pressure  on  turn-around  it  to  provide 
him  with  quick-look  printer  plots  on  his  computer  listing.  To  be  effective, 
however,  the  printer  plots  must  be  of  sufficient  usefulness  to  satisfy  his 
curiosity  and  his  anxiety  for  the  next  few  hours,  until  the  actual  finished 
plots  are  returned.  Whether  this  very  important  requirement  is  met  depends 
upon  the  design  of  the  printer  plot  program,  which  in  turn  will  be  discussed 
in  more  detail  in  the  section  dealing  with  software. 

Another  frequent  demand  is  for  larger  plots  in  order  to  obtain  increased 
resolution.  As  was  mentioned  previously,  however,  the  objective  of  produc- 
tion graphics  is  not  to  provide  a source  for  specific  precise  data  elements. 
Furthermore,  Increased  plot  size  entails  increased  capital  outlay  for  equip- 
ment, as  well  as  increased  periodic  expenditures  for  consumables — not  to 
mention  the  logistical  annoyance  of  finding  a place  to  store  the  larger 
sheets  of  paper.  Almost  all  production  graphics  requirements  can  be  met 
with  plots  (or  series  of  plots)  on  standard  8 1/2  x 11  inch  paper,  which 
permit  direct  inclusion  in  the  published  report.  The  greatest  problem 
here  is  in  convincing  the  customer  of  this  fact,  especially  if  he  has  been 
accustomed  to  a larger  size  in  the  past.  Experience  shows  that  this  is 
nothing  more  than  a matter  of  conditioning,  and  if  the  user  can  be  convinced 
to  at  least  try  a standard  size  plot  for  a period  of  time,  he  will  come  to 
find  it  satisfactory  in  every  respect. 

The  desire  for  multicolor  graphics  is  another  aspect  that  is  more 
related  to  conditioning  than  to  actual  need.  There  are  certainly  many  cases 
where  it  is  useful  to  have  two  or  more  plot  traces  on  the  same  grid,  and 
there  must  be  some  way  to  uniquely  identify  each  one.  This  may  be  done  by 
changing  the  color  of  the  Individual  trace,  or  by  changing  its  mode  (i.e.. 
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by  drawing  a point  or  symbol  plot,  as  opposed  to  a line  plot).  While  the 
multicolor  plot  is  pleasing  from  an  aesthetic  point  of  view,  it  does  leave 
Lhe  door  open  fcr  error  on  the  part  of  the  equipment  operator,  and  in  the 
long  run  it  may  turn  out  to  be  completely  useless  if  it  must  be  reproduced 
in  black  and  white  within  a printed  report.  Consequently,  the  multicolor 
capability  is  not  necessarily*  as  useful  as  it  would  first  appear,  whereas 
the  single-color  multi-mode  plot  can  not  only  fully  satisfy  the  trace  identi- 
fication requirement,  but  is  actually  superior  for  reproduction  purposes. 


The  final  customer  requirement  to  be  mentioned  here  is  that  of  system 
capacity,  or  the  total  number  of  plots  that  the  system  is  capable  of  producing 
in  a given  period  of  time.  The  demands  on  graphics  systems  are  similar  to 
the  demands  on  all  data  processing  equipment  in  the  real  world,  in  that  any 
increase  in  capacity  will  automatically  be  met  (or  exceeded)  by  an  increase 
in  demand.  In  other  words,  capacity  breeds  demand.  This  is  not  meant  as  a 
humorous  aside,  but  as  a statement  of  a very  real  fact  that  computer  systems 
personnel  have  had  to  deal  with  for  many  years.  And  this  is  not  in  itself 
all  bad  either,  since  increased  graphics  capability  can  often  speed  up  the 
work  of  the  analysts  and  data  report  users  by  eliminating  some  of  the  need 
to  pore  over  long  data  listings.  In  any  case,  as  long  as  the  minimum  essential 
capacity  for  the  operation  is  met,  the  only  major  concern  beyond  that  is  to 
ensure  that  any  excess  capacity  of  the  system  is  not  so  great  that  it  results 
in  purely  wasteful  expansion  of  usage,  manpower,  and  materials. 


III.  Operational  Procedures 


Basic  operational  procedures  in  a production  graphics  system  must  satisfy 
a number  of  important  goals.  First,  of  course  they  must  satisfy  the  plotting 
requirements  of  the  customer,  but  they  must  do  so  in  an  efficient  manner  if 
high  production  is  to  be  achieved.  Some  of  the  areas  that  must  be  encompassed 
are  the  issues  of  on-line/off-line  operation,  security,  scheduling,  and 
reliability. 

The  choice  between  on-line  or  off-line  operation  of  graphics  hardware, 
as  an  operational  matter,  often  goes  hand-in-hand  with  the  customer's  interest 
in  turn-around  time.  The  customer,  of  course,  visualizes  greatly  improved 
turn-around  by  operating  the  equipment  on-line.  Without  a doubt,  this  mode 
has  its  advantages — generation  of  the  plots  at  the  time  of  the  original  com- 
puter reduction,  elimination  of  the  need  for  an  off  line  plot  tape,  and  even 
increased  speed  of  operation  of  the  graphics  hardware  in  many  cases.  All 
of  this,  but  at  what  cost?  The  most  damaging  aspect  is  that  the  most  expen- 
sive component  of  all — the  computer  itself — is  bound  by  the  mechanical  speed 
of  the  graphics  hardware  (even  to  a certain  extent  in  a multiprocessing 
system).  Additionally,  one  or  more  1/0  ports  may  be  continuously  tied  up, 
not  to  mention  the  fact  that  an  operator  must  be  on  call  at  al]  times. 

By  operating  the  graphics  equipment  off-line,  the  data  transfer  rate  is 
at  full  tape  speed,  and  the  tapes  themselves  can  be  grouped  so  that  operating 
personnel  for  the  graphics  equipment  need  to  be  available  only  during  certain 
specific  time  periods.  Consequently,  operating  personnel  requirements  are 
reduced  somewhat,  and  the  computer  mainframe  is  not  hobbled  by  the  compara- 
tively slow  speed  of  the  plotter  hardware.  Turn-around  can  still  be  provided 
on  some  satisfactory  pre-arranged  basis,  such  as  four,  eight,  twelve,  or 
twenty-four  hours.  The  optimum  scheduling  depends  on  the  volume  of  graphics 
generated,  the  operating  characteristics  of  the  hardware,  and  the  shift  hours 
of  the  operating  personnel;  but  this  usually  has  to  be  modified  somewhat 
to  take  into  account  customer  requirements  and  other  influencing  factors. 

Some  experimentation  is  to  be  expected  when  establishing  a new  production 
system. 

Probably  the  most  neglected  aspect  of  graphics  operations  is  that  of 
physical  and  Information  security,  and  graphics  security  often  takes  second 
place  to  other  areas  of  data  processing  security.  Possibly  this  is  due  to 
the  fact  that  the  investment  in  graphics  hardware  is  usually  small  in  compari- 
son to  the  remainder  of  the  ADP  equipment,  and  improper  use  of  materials  or 
services  is  not  as  likely.  Nevertheless,  physical  security  affects  informa- 
tion security,  and  classified  graphics  must  be  just  as  closely  guarded  as 
any  other  security  sensitive  information. 

The  most  straightforward  method  of  handling  the  problem  is  to  locate 
the  graphics  equipment  in  a physically  secure  area.  This  could  well  be  in 

the  same  room  as  the  computer  itself,  though  some  types  of  graphics  hardware  ^ 

emit  fumes  or  dust  which  would  preclude  such  an  arrangement. 
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The  customer,  on  the  other  hand,  is  sure  to  want  hands-on  access  to  the 
equipment.  This  must  be  carefully  avoided  in  order  to  prevent  interference 
with  the  operating  personnel,  and  also  to  prevent  possible  breaches  of 
security.  Hie  desire  to  be  present  during  the  plotting  process  will  be 
greatly  diminished  if  he  is  supplied  with  useful  quick-look  printer  plots, 
and  if  the  turn-around  time  is  reasonable  and  dependable. 

Once  physical  security  is  achieved,  the  job  of  information  security 
becomes  a matter  of  establishing  an  awareness  of  the  importance  of  the 
security  of  classified  graphics,  and  of  course  strict  accountability  and 
absolute  adlierence  to  those  procedures  established  by  the  installation 
security  office. 

Though  there  are  many  additional  operational  questions  that  must  be 
resolved  within  any  given  system,  the  last  matter  to  be  addressed  here  is 
that  of  reliability.  This  refers  not  only  to  hardware  reliability  which  is 
a maintenance  responsibility,  but  to  operational  reliability.  In  other  words, 
the  customer  must  be  able  to  reasonably  expect  that  his  plots  will  be  delivered 
to  him  within  the  established  schedule.  The  greatest  pitfall  is  failure 
to  maintain  appropriate  communications  with  the  hardware  maintenance  personnel. 
It  is  the  operator  who  will  discover  malfunctions,  and  he  should  immediately 
report  all  details  of  the  problem  to  the  maintenance  section.  A foolish,  but 
very  common  occurence  is  to  shut  the  machine  off  ard  expect  the  maintenance 
people  to  find  the  trouble  during  their  regular  calibration  or  PM  checks. 

The  odds  are  chat  they  won't  find  it,  and  no  amount  of  shifting  the  blame 
will  get  the  customer's  plots  to  him  any  faster. 

In  an  off-line  system  where  tapes  are  grouped  and  the  actual  plotting 
done  at  specific  times,  there  is  a temptation  during  slow  periods,  when 
only  a few  tapes  are  generated,  to  hold  them  over  until  the  next  scheduled 
plotting  session.  This  may  at  first  glance  appear  to  improve  efficiency,  but 
it  is  more  likely  the  case  that  the  operator  who  would  have  had  only  a little 
bit  to  do  will  then  simply  have  nothing  to  do.  Then  the  customer  will 
come  knocking  on  the  door  at  the  regular  time,  only  to  find  out  that  the 
turn-around  (which  he  probably  thought  was  too  long  in  the  first  place)  has 
been  increased  not  by  a heavy  workload  but,  incomprehensibly,  by  a light 
workload . 

On  the  other  hand,  total  inflexibility  can  be  just  as  disastrous.  If 
the  operations  manager  is  aware  of  an  upcoming  increase  in  production,  it  is 
his  duty  to  prepare  in  advance  by  scheduling  additional  periods  of  operating 
time. 
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IV.  Software 


A.  General  Requirements 

Software  is  a critical  component  in  the  production  graphics  systems, 
wherein  the  mathematician/programmer  has  an  opportunity  to  develop  software 
which  will  not  only  satisfy  all  of  the  production  and  user  requirements,  but 
will  also  reduce  throughput  time  and  lessen  the  customer's  workload.  One 
must  not  make  the  mistake,  however,  of  assuming  that  all  software  must  be 
developed  in-house  to  obtain  maximum  efficiency.  There  are,  for  example, 
certain  basic  routines  which  perform  such  mundane  tasks  as  buffering  and 
writing  the  output  tape,  drawing  alphanumeric  characters,  etc.,  which  are 
normally  supplied  by  the  hardware  manufacturer  at  minimal  (or  zero)  cost. 

These  routines  are  generally  quite  versatile  and  efficient  as  supplied,  and 
it  is  simply  not  possible  to  develop  substitutes  in-house  at  lower  cost. 

It  must  also  be  realized  that  not  all  types  of  graphics  software  are 
either  necessary  or  economically  justifiable  within  a given  system.  Probably 
the  best  example  of  this  is  in  the  area  of  three-dimensional  or  perspective 
graphics.  Most  every  organization  has  an  occasional  application  where  3-D 
capability  would  be  nice  to  have,  but  the  acquisition  or  development  of  such 
routines  is  costly  enough  that  the  economic  justification  in  terms  of  poten- 
tial benefit  should  be  carefully  scrutinized  before  commiting  funds  or 
resources. 

The  major  portion  of  the  software  for  a production  graphics  system 
must,  however,  be  developed  in  house,  and  though  it  must  in  certain  ways  be 
tailored  to  the  specific  needs  and  requirements  of  the  particular  installa- 
tion, there  are  many  features  which  are  essential  to  any  high-efficiency, 
high-production  system.  Software  design  is  heavily  influenced  by  the  fact 
that  often  the  customer  is  an  analyst/technician  with  little  or  no  background 
in  mathematics  or  programming,  and  as  such  he  has  a tendency  to  suspect  the 
operation  of  the  plot  program  before  suspecting  the  data  in  the  event  of  a 
malfunction. 

In  actual  practice,  most  abnormalities  are  in  the  data  itself,  and 
well-designed  graphics  software  will  reveal  this  to  the  user  on  the  listing 
generated  during  the  plot  run  on  the  computer.  Typical  examples  are:  (a) 
the  analyst  used  the  wrong  data  tape.  (b)  He  used  the  right  tape,  but  was 
looking  at  the  wrong  listing.  (c)  He  plotted  the  wrong  parameters.  (d)  The 
data  was  in  different  units  from  those  stated.  (e)  The  input  data  file  was 
blank. 

The  software,  then,  must  first  of  all  minimize  the  effort  required  by 
the  user  to  initiate  a run.  This  is  accomplished  by  providing  him  with 
clear  and  complete  documentation  beforehand,  and  by  incorporating  features 
in  the  program  that  minimize  his  workload  and  chance  for  error.  For  example, 
where  there  are  a number  of  possible  selections  for  a given  plot  option,  the 
most  commonly  used  selection  should  be  used  under  default  conditions.  In 
other  words,  the  mere  act  of  not  specifying  any  selection  should  be  understood 


to  mean  chat  the  "standard"  selection  is  to  be  used.  If  the  various  options 
are  to  be  specified  on  cards,  the  layout  should,  of  course,  be  orderly  and 
neat . 


Without  delving  into  the  innumerable  graphics  options  concerning  fonn 
and  style,  we  can  look  specifically  at  those  features  whose  availability 
can  either  diminish  the  possibility  for  error  initially,  or  isolate  the 
cause  of  an  irregularity  once  it  does  occur. 

The  first  of  these  is  the  matter  of  the  selection  of  the  miniinum  and 
maximum  plotting  limits.  The  specific  techniques  are  described  in  detail  in 
a following  section,  but  stated  briefly,  there  must  be  a system  which  will 
automatically  select  limits  and  increments  that  are  pleasing  to  the  eye, 
contain  all  of  the  data,  and  do  not  extend  unnecessarily  far  beyond  the 
limits  of  the  actual  data.  The  implementation  of  this  single  option  elimi- 
nates the  source  of  many  errors  by  doing  away  with  manual  scanning  of  data 
and  manual  calculation  of  appropriate  round  limits  and  increments.  There 
must,  of  course,  exist  the  ability  to  easily  override  the  option  in  order  to 
insert  specific  limits  when  desired. 

Graphics  labeling  and  numerical  annotation  of  axes  must  be  done  by  the 
plotter  as  a part  of  the  plotting  process.  Any  clerical  post-processing 
step  for  this  purpose  is  prone  to  error,  and  in  any  case  will  increase  the 
turn-arcund  time  dr.r.TiaCical  ly . Security  classification  labeling,  where 
appropriate,  is  also  ha.ndled  in  the  same  manner.  ’Wljen  classified  plots 
are  properly  marked  at  the  time  tliey  are  generated,  their  susceptibility 
to  mishandling  is  lessened,  and  the  overall  security  posture  is  improved. 

The  method  of  specification  by  the  user  of  headings  and  legends  may 
vary,  but  a catalog  or  table  of  standard  headings  might  be  maintained  where 
there  is  frequent  repetition.  The  appropriate  headings  would  then  be  intro- 
dueed  upon  recognition  of  a unique  project  number  of  other  code.  Numerical 
annotation,  on  the  other  hand,  should  always  be  generated  automatically  on 
the  basis  of  the  minimum  and  maximum  plotting  limits  in  use,  with  no  addi- 
tional action  or  entries  required  on  the  part  of  the  initiator. 

There  must  be  a degree  of  flexibility  iti  the  handling  of  input  data  if 
the  system  is  not  to  break  down  under  va:“ying  mission  requirements.  usually 
in  a production  grphics  system  there  will  be  some  standard  data  format  that 
will  be  used  for  the  bulk  of  the  pri'cessing,  but  there  remain  the  odd  cases 
th.'it  appear  from  time  to  time  in  totally  foreign  data  formats  that  require 
‘•('me  form  of  gr.iphics  processing.  One  approach  is  to  read  the  data,  rewrite 
it  in  standard  format,  then  read  it  again.  Much  of  this  1/0  can  be  avoided, 
thiiigh,  if  the  graphics  software  has  some  provision  for  direct  entry  of 
data.  This  raiglit  be  through  a subroutine  call,  or  through  a common  storage 
block.  With  such  a provision,  it  is  only  necessary  to  provide  a small 
routine  which  is  capable  of  reading  or  receiving  the  nonstandard  data,  and 
the  remainder  of  tlie  data  transfer  is  accomplished  tlirough  what  is  effectively 
cure-to-Lore  iiovemenl  . 

Many  graphit  t;  raa  1 f iinct  i tnis  results  from  the  fact  that  the  data  to  be 
plotted  is  not  in  the  units  of  measure  that  the  customer  believes  them  to  be. 
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In  some  cases  this  will  result  in  total  failure  of  the  job  to  produce  the 
required  plots,  whereas  other  cases  may  result  in  a built-in  bias  Lhrougliout 
the  plot.  The  first  situation  is  not  particularly  dangerous  since  it  is  quite 
clear  that  some  specification  must  have  been  in  error.  On  the  other  hand, 
the  second  case  generates  an  insidious  error  that  may  go  entirely  undetected 
(e.g.,  plotting  yards  instead  of  meters).  Unfortunately,  there  is  not  a lot 
that  we  can  do  to  prevent  the  customer's  carelessness  in  this  matter,  but  it 
is  possible  to  incorporate  into  the  graphics  software  a procedure  for  units 
conversion  of  at  least  the  more  common  measurement  units.  Conversion  factors 
would  normally  be  stored  table-wise  and  referenced  by  an  appropriate  code 
during  initialization.  Digressing  somewhat  to  a point  that  should  be  kept 
in  mind,  is  that  although  most  conversions  simply  require  a multiplicative 
operation  with  a single  factor,  there  are  a few  cases  that  require  an  addi- 
tional operation  with  an  additive  factor  (degrees  Farenheit  to  degrees 
Centigrade,  for  example).  The  implication  is  that  it  may  be  necessary  to 
maintain  two  tables--one  for  each  type  of  factor — and  this  may  be  a considera- 
tion in  terms  of  core  conservation  if  a great  number  of  conversions  are  to 
be  made  available.  Another  alternative  is  to  simply  make  provisions  to 
allow  the  user  to  enter  the  specific  factor(s)  himself,  doing  away  with  the 
need  for  conversion  factor  tables  within  the  graphics  software. 

It  has  already  been  stated  that  the  provision  of  quick-look  printer 
plots  is  essential  in  a production  graphics  operation.  There  are  a number 
of  good  reaons  for  inclusion  of  the  capability,  not  the  least  of  which  is  to 
reduce  customer  pressure  on  turn-around  time  by  temporarily  satisfying  his 
curiosity.  This  in  turn  makes  it  easier  to  deny  him  physical  access  to 
the  plotter  equipment,  where  we  want  to  maintain  high  security.  In  order 
for  the  printer  plots  to  accomplish  this,  they  must  he  informative,  complete, 
and  easy  to  read.  Also,  one  should  not  overlook  the  fact  that  there  are 
many  occasions  where  a high-resolution  plot  is  not  needed,  ana  a printer 
plot  is  all  that  the  customer  desires.  Consequently,  tlie  option  should  exist 
to  generate  only  the  printer  plots  wiien  so  requested,  without  having  to 
unnecessarily  generate  the  regular  plots. 

If  the  printer  plot  is  to  satisfy  the  user,  it  must  incorporate  a numher 
of  features  that  allow  it  to  convey  as  much  information  as  possible  at  the 
least  possible  cost  in  terms  of  processing  and  core  storage.  The  details  of 
these  features  are  further  described  in  Section  IV. C. 

As  to  be  expected,  irregularities  will  occur  from  time  to  time  in  the 
processing  of  graphics  data,  and  though  the  graphics  software  cannot  prevent 
them,  it  can  in  many  cases  provide  the  means  to  quickly  isolate  the  cause 
of  the  problem.  One  of  the  most  valuable,  techniques  in  this  respect  is  the 
accurate  accounting  of  all  data  points  processed.  I'liis  means  nothing  more 
than  accounting  for  all  points  examined,  and  classifying  them  as  to  wlietlier 
they  were  actually  plotted,  or  were  blank,  c'ff-srale,  or  otherwise  unusable. 

A summary  of  the  accounting  for  each  plot  is  tlieii  printed  on  the  customer's 
computer  listing,  or  on  the  plots  themselves  if  appropriate.  In  tlie  event 
of  a failure,  the  analyst  will  know  exactly  how  many  data  points  were  pro- 
vided to  the  plot  program,  and  what  t tie  i r disposition  was.  This  infoniiation 
will  often  point  directly  to  the  cause  of  an  unexiiected  malfunction.  The 
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beauty  of  this  feature  is  its  simplicity;  very  little  processing  time  or 
storage  is  required.  A sample  algorithm  is  included  in  a later  section. 


It  has  not  been  the  intent  here  to  enumerate  all  conceivable  options 
in  a graphics  program,  but  instead  to  describe  a few  of  the  more  important 
features  which  can  reduce  user  effort,  minimize  errors,  improve  throughput, 
and  aid  in  troubleshooting  those  problems  which  are  inevitable.  Nevertheless, 
all  plot  options  specified  by  the  user  should  be  carefully  checked  for  errors 
by  the  graphics  software  during  its  initialization  phase,  and  substitutions 
made  for  erroneous  entries  where  practical.  In  all  cases,  explicit  informative 
diagnostics  should  be  given. 


B.  Automatic  Selection  of  Minimum  and  Maximum  Plotting  Limits 


The  automated  selection  of  plotting  limits  can  save  the  analyst  con- 
siderable time  and  effort  that  would  otherwise  be  spent  carrying  out  this 
task  manually,  and  it  is  not  particularly  difficult  to  design  a routine 
which  can  efficiently  select  limits  which  are  aesthetically  pleasing,  which 
contain  all  of  the  data,  and  which  do  not  extend  unnecessarily  far  beyond 
the  limits  of  the  actual  data. 

To  begin  with,  the  raw  data  contains  a range  of  values,  and  there  will 
exist  some  absolute  minimum  and  maximum  value  for  both  the  ordinate  and 
abscissa.  When  it  comes  time  to  plot  the  data,  we  must  have  some  idea  of  its 
bounds  if  the  plot  is  to  be  meaningful.  The  retrieval  of  the  absolute  min 
and  max  values  can  be  achieved  by  guess,  by  manual  scanning  of  the  data,  or 
by  some  automated  process  accomplished  by  the  software.  Once  obtained, 
these  absolute  limits  are  usually  of  such  a nature  that  they  must  be  rounded 
in  order  for  the  plot  to  be  easily  readable. 

The  first  decision  to  be  made  is  the  manner  in  which  the  absolute 
limits  are  to  be  retrieved.  Manual  processing  is  out  of  the  question  in  a 
production  system,  but  there  are  still  several  automated  processes  to 
choose  from.  The  first  of  these  is  to  place  the  entire  data  set  in  core  and 
scan  it  for  the  min  and  max  values.  Obviously,  this  system  is  practical 
only  for  those  trivial  cases  where  the  data  set  consists  of  no  more  than 
a few  thousand  pairs  of  coordinates.  Since  no  production  graphics  systems 
can  tolerate  this  type  of  constraint,  it  must  be  discarded.  The  second 
method  is  to  read  through  the  entire  set  of  data  while  it  resides  on  tape 
or  mass  storage,  saving  and  updating  the  minimum  and  maximum  values  in  the 
process.  Once  accomplished,  the  graphics  program  will  have  to  reread  all 
of  the  data  to  carry  out  the  actual  plotting.  This  is  a functional  method 
which  may  be  the  only  alternative  in  some  cases,  but  it  is  wasteful  of  1/0 

effort  since  all  of  the  data  must  be  read  twice. 

The  ideal  method  of  handling  the  problem  is  to  have  the  progr.un  which 

initially  generates  the  data  save  and  update  the  min  and  max  values  as  each 

data  point  is  generated.  This  requires  a degree  of  cim  rdination  between 
the  parties  responsible  for  the  data  reduction  software  and  those  responsible 
for  the  graphics  software,  but  if  accomplished,  rereading  of  the  entire  data 
set  becomes  unnecessary  and  the  savings  in  I/O  processing  time  are  signifi- 
cant. 

Once  the  raw  or  absolute  minimum  and  maximum  values  are  known,  appropriate 
round  limits  can  be  selected  which  will  make  the  plot  more  readable. 

Naturally,  there  must  be  a simple  method  to  permit  the  user  to  insert  any 
limits  he  desires  (he  may  want  to  plot  only  a particular  segment  of  the 
data,  for  example),  but  the  graphics  software  must  also  have  within  it 
some  method  of  intelligently  altering  the  raw  min  and  max  values  such  that 
round  numbers  will  appear  at  the  extremities  of  the  axes,  as  well  as  at 
the  intermediate  increments.  In  other  words,  not  only  must  the  endpoints 
have  round  numbers  associated  with  them,  but  the  increment  per  unit  of  axis 
length  must  be  an  appropriate  round  number  also. 
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The  process  of  selecting  the  round  limits  and  increment  must  be 
optimized,  for  if  the  resulting  range  is  unnecessarily  broad  the  plot  will 
contain  a great  deal  of  blank  space  while  the  plot  trace  itself  is  overly 
compressed.  Needless  to  say,  limits  which  fail  to  include  all  of  the 
available  data  are  totally  unacceptable.  Finally,  the  routine  devised  for 
the  task  should  not  be  so  zealous  as  to  attempt  to  improve  upon  limits  which 
are  already  optimum,  as  frequently  occurs  when  data  is  generated  artificially 
during  simulations. 

The  general  form  of  an  appropriate  algorithm  is  shown  in  Figure  1,  and 
a Fortran  TV  subroutine  which  incorporates  all  of  the  desired  features  and 
is  usable  without  modification,  is  also  included.  Some  samples  of  limits 
generated  by  the  routine  are  included  immediately  following  the  program 
listing. 

Automated  selection  of  plotting  limits  can  be  the  salvation  of  the 
harried  data  analyst,  but  there  are  a few  minor  pitfalls  to  be  guarded  against. 
The  most  obvious  of  these  is  that  auto  limit  selection  can  cause  smooth  data  to 
appear  rough.  For  example,  a plot  depicting  a sequence  of  very  small  radar 
range  errors  would  automatically  be  expanded  to  fill  the  whole  page.  While 
the  data  is  displayed  accurately,  the  visual  impression  is  that  the  errors 
are  quite  large.  In  cases  like  these,  it  is  better  to  enter  standard  limits 
so  that  all  plots  of  the  same  type  will  be  uniform. 

A second  pitfall  is  that  a single  piece  of  erroneous  data,  if  it  is 
itself  a minimum  or  maximum,  can  upset  the  selection  process.  The  graphics 
software  has  no  way  of  detecting  such  an  occurrence,  and  must  assume  that 
it  has  been  provided  with  accurate  data.  Most  data  reduction  software 
provides  some  degree  of  editing  and  filtering,  however,  so  the  frequency 
of  this  type  of  problem  is  minimal.  When  it  does  occur,  the  analyst  has  no 
recourse  but  to  manually  select  more  appropriate  limits  and  rerun  the  plot. 

Another  minor  problem  arises  when  the  plot  is  to  have  a logarithmic 
scale,  and  either  negative  data  or  negative  limits  exist.  Whenever  log  plots 
are  used  and  a negative  limit  is  generated,  care  must  be  taken  not  to  attempt 
to  take  the  log  of  that  limit.  Also,  one  can  see  that  if  the  round  limits 
are  generated  on  the  basis  of  the  raw  data,  the  resulting  log  scale  usually 
will  not  meet  the  requirements  originally  set  forth.  The  most  practical 
procedure  is  to  first  check  the  raw  limits,  substituting  zero  for  a negative 
one  when  necessary.  Then  take  the  logs  of  these  raw  limits,  with  the  computed 
log  values  in  turn  being  supplied  as  input  to  the  round  limits  subroutine. 

This  gives  the  log  scale  on  the  axis  a more  acceptable  appearance,  and 
simultaneously  avoids  an  unresolvable  condition  within  the  log  function. 


Obtain  axis  length  and  raw 
minimum  and  maximum  values 


Calculate  the  increment  per  unit 
length,  based  on  the  raw  values 


Separate  the  raw  increment  into 
its  mantissa  and  characteristic 


Round  up  the  mantissa  to  a "standard"  value 


Recombine  the  rounded  mantissa  with  the 
characteristic  to  produce  the  round  increment 


Find  the  first  integral  multiple  of  the  round 
increment  which  is  below  the  raw  minimum, 
and  use  this  value  as  the  round  minimum 


Compute  the  round  maximum  by 
finding  the  product  of  the  axis  length 
and  the  round  increment,  and  adding 
this  value  to  the  round  minimum 
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C.  Quick-look  Printer  Plots 


Though  printer  lots  are  of  low  resolution  by  nature,  they  are  intended 
not  to  take  the  place  of  high-resolution  plots,  but  to  give  the  data  analyst 
something  to  look  at  while  the  final  plots  are  being  processed.  They  can 
serve  to  reduce  customer  pressure  on  tutu-around  time,  and  reduce  customer 
desire  to  be  physically  present  in  the  plotting  room.  But  if  they  are  to 
fully  satisfy  the  user,  the  printer  plots  must  be  the  same  in  every  detail 
possible  as  the  final  plot.  That  is,  the  printer  plot  must  be  fully  labeled, 
must  have  some  form  of  grid,  and  must  be  numerically  annotated  using  the 
same  limits. 

One  exception  to  this  is  that  in  a production  system,  the  physical  size 
of  the  printer  plots  remains  constant  (usually  being  designed  to  fill  one 
full  page),  regardless  of  variations  in  size  of  the  high-resolution  plots. 

The  reasons  are  two-fold:  First,  any  reduction  in  size  below  one  page 
will  further  reduce  the  resolution  to  an  unacceptable  level,  and  second, 
what  little  advantage  might  be  gained  by  expansion  is  more  than  offset  by 
the  increased  complexity  of  the  program.  The  only  significant  drawback  to 
constant-size  printer  plot  is  an  aesthetic  one:  Intermediate  numerical 
annotation  points  do  not  always  fall  at  the  same  increments  as  on  the 
high-resolution  plot.  Remember,  though,  that  the  information  conveyed  is 
the  same,  and  the  printer  plot's  purpose  is  only  to  serve  the  user  until 
the  final  plots  are  delivered  a few  hours  later. 

There  are  a great  many  printer  plot  styles  in  existence,  serving  a 
variety  of  special  purposes,  but  generally  speaking,  the  production  graphics 
system  should  confine  itself  to  a straightforward  rectangular  format.  The 
data  Itself  is  then  represented  by  appropriately  placed  symbols  within  the 
printed  grid.  Both  the  utility  and  simplicity  of  such  a system  are  highly 
attractive. 

It  should  become  clear  as  one  undertakes  the  design  of  printer  plot 
software  that  no  part  of  the  plot  can  be  printed  out  until  all  of  the  data 
has  been  processed,  since  there  is  no  way  of  knowing  until  completion 
whether  a given  line  of  print  contains  all  of  its  required  data  points. 

(There  are  exceptions,  but  the  program  must  be  able  to  plot  all  data — not 
just  the  exceptions.)  In  other  words,  the  entire  plot  must  be  stored  some- 
where for  the  duration  of  the  plotting  process.  This  does  not  mean  that 
all  of  the  data  must  be  stored,  but  that  an  array  of  print  characters  must 
be  maintained  and  updated  as  the  plotting  progresses. 

As  an  example,  assume  that  the  printer  plot  is  to  consist  of  a 
rectangle  that  measures  50  by  100  increments.  This  will  require  51  lines 
by  101  print  positions,  for  a total  array  of  5151  characters.  The  sim- 
plistic approach  would  be  to  set  aside  an  array  of  5151  words  and  update 
the  word  corresponding  to  the  plotted  point.  But  this  is  needlessly 
wasteful  of  core  storage  since  each  word  has  the  potential  of  holding 
several  characters.  If,  for  example,  each  word  can  hold  six  characters, 
then  each  line  of  101  characters  can  be  contained  in  17  words,  and  the  51 
lines  then  occupy  a total  of  only  867  words — a net  savings  of  4284  words. 
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This  method  does  require  character  manipulation  and  masking  operations 
by  the  software,  but  they  are  both  simple  and  straightforward,  lending  them- 
selves to  either  Boolean  functions,  or  better,  to  the  FLD  bit  manipulation 
function  where  available.  For  that  matter,  they  can  even  be  accomplished 
by  integer  division/multiplication  processes,  though  not  nearly  so 
conveniently. 

A representative  printer  plot  program  which  incorporates  all  of  the 
desired  features  is  included  at  the  end  of  this  section,  along  with  some 
samples  of  its  output.  While  some  may  find  application  for  the  program  in 
its  present  form,  the  actual  purpose  of  its  inclusion  here  is  simply  to 
demonstrate  a working  program  which  has  all  of  the  features  required  in  a 
production  graphics  operation.  This  sample  routine  was  prepared  for  use 
on  a UNIVAC  1108  with  36-bit  words  and  UNIVAC  character  codes,  so  use  on 
machines  with  different  word  lengths  or  different  character  codes  would 
require  appropriate  changes. 
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D.  Accounting  of  Data  Points  Processed 
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Inevitably,  some  graphics  jobs  will  not  produce  the  output  that  the 
customer  expected,  and  the  resourceful  graphics  progranioer  will  prepare 
in  advance  for  this  contingency  if  he  is  to  avoid  spending  many  of  his 
future  days  troubleshooting  other  people's  production  problems.  Since 
most  production  graphics  problems  are  in  fact  caused  by  the  customer's 
inattention  to  the  input  data,  what  we  would  like  to  do  is  give  him  infor- 
mation with  every  plot  that  will  allow  him  to  discover  the  cause  of  the 
malfunction  before  he  resorts  to  contacting  either  the  software  develop- 
ment personnel  or  the  operations  personnel. 

The  single  most  valuable  tool  in  this  respect  is  the  simple  matter  of 
accurately  accounting  for  all  data  points  processed  by  the  graphics  soft- 
ware. This  usually  consists  of  nothing  more  than  a summary  of  the  total 
number  of  points  examined,  along  with  a breakdown  of  their  disposition  as 
to  whether  they  were  actually  plotted,  were  off  scale,  and  so  forth.  If, 
for  example,  the  user  receives  a blank  plot,  the  summary  can  immediately 
tell  him  whether  the  expected  data  was  even  presented  to  the  plot  program 
in  the  first  place,  and  if  it  was,  the  summary  will  indicate  the  probable 
cause  of  its  failure  to  be  plotted. 

It  is  useful  to  classify  the  status  of  data  points  according  to  how 
they  are  handled  by  the  plot  program.  A sampling  of  some  commonly  used 
classifications  are: 

Blank — A data  point  that  has  been  replaced  by  a blank  or  transparent 
word  that  serves  as  filler  where  the  particular  parameter 
does  not  exist  because  it  was  not  available  or  was  not  recorded. 

Off-scale — A data  point  which  has  either  or  both  coordinates  lying 
outside  the  limits  specified  by  the  extremities  of  the  axes. 

Outside  start  or  end  times--Where  the  option  exists  to  specify  further 
restricting  limits  (usually  based  on  some  interval  of  time),  a 
data  point  that  is  on-scale,  but  not  within  the  start  or  end 
time  limits. 

Thinned-out--A  data  point  that  passes  all  of  the  above  tests,  but  is 
thinned  out  to  reduce  the  density  of  the  plotted  points. 

An  examination  of  the  characteristics  of  these  various  conditions  will 
show  that  they  should  be  tested  in  the  order  listed,  in  order  to  provide 
the  most  meaningful  information  to  the  analyst.  Some  categories  can  be 
broken  down  even  further,  such  as  off-scale  data  points  which  could  be  off 
in  X,  Y,  both  X and  Y,  and  so  forth,  but  this  is  usually  not  necessary. 
Figure  2 shows  a practical  approach  to  the  overall  accounting  process, 
and  the  following  page  shows  an  example  of  what  the  summary  might  look  like 
on  the  user's  computer  listing. 
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Anothjir  useful  feature  which  is  related  to  the  accounting  of  data 
points  is  to  save  and  print  out  the  coordinates  of  the  first  and  last  data 
points  plotted,  and  if  desired,  the  first  and  last  points  examined.  This 
gives  the  analyst  the  ability  to  track  down  at  least  two  specific  data 
elements  at  the  extremities  of  the  plot  when  troubleshooting  a malfunction. 
Core  and  processing  requirements  for  this  task  are  trivial. 

Finally,  the  plots  and  grids  themselves  should  be  counted,  and 
summarized  at  the  end  of  the  job'.  The  grids  are  counted  separately  since 
several  plots  may  be  placed  on  a single  grid,  so  the  number  of  grids  is  not 
necessarily  the  same  as  the  number  of  plots.  A typical  summary  might 
state  something  like  "126  PLOTS  ON  84  GRIDS."  This  way,  the  analyst  knows 
at  a glance  whether  the  job  was  completed  in  its  entirety. 
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V.  The  White  Sands  Production  Graphics  System 

White  Sands  Missile  Range  is  one  of  the  world's  busiest  test  ranges, 
where  enormous  quantities  of  data  are  collected  by  both  optical  and  electronic 
means.  A wide  variety  of  data  reduction  programs  reduce  the  raw  data  on  a 
production  basis  in  order  to  provide  the  range  users  with  complete  and 
timely  data  reports  for  use  in  analysis  of  the  individual  test  missions. 

Most  of  these  reports  contain  considerable  graphics,  which  are  generated  by 
a production  graphics  system  which  has  undergone  a number  of  changes  over  the 
years . 

Prior  to  1971,  several  large  flat-bed  plotters  were  in  use,  producing 
a total  of  fewer  than  fifty  plots  per  day.  These  machines  were  operated 
off-line,  and  the  customer  had  to  run  the  computer  job,  retrieve  the  plot 
tape,  personally  deliver  the  tape  to  the  plotting  room  along  with  a work  order 
specifying  grid  size,  scale  factors,  plot  mode  (point  or  line),  ink  color, 
and  so  forth.  The  user  specifications  were  entered  into  the  plotters  through 
switches,  and  the  equipment  was  manually  calibrated  for  each  plot,  based  on 
the  stated  paper  size  and  scaling  factors.  Once  completed,  the  customer 
picked  up  the  plots  and  tape,  disposed  of  the  tape  as  necessary,  then  turned 
in  the  plots  to  a reports  section  to  be  manually  labled  and  annotated.  The 
entire  process  was  not  only  tedious,  but  susceptible  to  human  error  at  every 
stage . 

Due  to  an  increasing  workload,  and  a continuing  obsolescence  of  the 
existing  graphics  equipment,  a changeover  was  made  that  converted  the  graphics 
operation  to  a high-production  system  that  eliminates  all  human  intervention 
that  affects  the  content  of  the  plots.  The  operational  procedure  is  shown 
schematically  in  Figure  3. 

The  core  of  the  system  is  a CalComp  Model  835  CRT  plotter  which  draws  the 
plots  on  a cathode  ray  tube  and  automatically  records  them  on  35  mm  film. 

The  customer  himself  simply  submits  his  job  to  the  computer,  where  an  off-line 
plot  tape  is  generated.  The  computer  listing,  with  full  and  complete  printer 
plots,  is  returned  directly  to  him,  while  the  plot  tape  is  automatically 
delivered  directly  to  the  plotter  without  any  action  on  his  part.  Plot 
tapes  are  grouped  at  the  plotter  (classified  tapes,  of  course,  are  handled 
separately) , and  at  designated  times  of  the  day  all  of  the  tapes  present  are 
run.  The  operator  has  only  to  mount  the  tape,  push  the  start  button,  and 
dismount  the  tape  when  finished. 

Every  plot  is  fully  labeled  and  numerically  annotated  (inclduing  security 
classification  when  needed)  as  it  is  drawn.  The  operating  speed  of  the  plotter 
is  such  that  a single  plot  generally  takes  no  more  than  five  seconds  to 
complete,  even  though  the  grid  itself  must  also  be  drawn.  The  film,  of 
course,  advances  automatically  between  plots.  A tape  containing  thirty 
complete  plots  can  be  fully  processed  by  the  operator  in  three  minutes  or 
so.  The  graphics  software  generates  a header  and  trailer  frame  at  the 
beginning  and  end  of  each  job  for  identification  purposes,  along  with  a 
unique  serial  number  which  is  recorded  in  small  numerals  in  the  far  corner 
of  each  plot. 
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tor  mount  the  reel,  push  the  loose  end  of  the  film  in  a hole,  turn  on  the 
water,  and  watch  it  wind  up  on  a reel  on  the  other  side.  The  processing  time 
for  a 200-foot  reel  (1600  plots)  is  about  15  minutes,  though  preparation  and 
cleanup  require  a few  minutes  additional. 

The  film  is  then  taken  to  a semi-automatic  printer  for  final  printing. 

The  plots  vary  in  size,  according  to  their  intended  use,  and  the  customer 
specifies  the  size  at  the  time  of  the  computer  run.  The  operator  at  the 
printer  does  have  to  change  paper  sizes  when  instructed  to  do  so  by  the 
!]  customer's  header  frame.  Maximum  plot  size  is  11  by  17  inches.  The  printing 

ji  process  requires  about  fifteen  seconds  per  plot. 

ij  Upon  completion  of  the  printing,  the  plots  are  returned  directly  to  the 

I user.  Since  all  labeling  and  annotation  is  done  by  the  plotter,  there  is 

I no  further  clerical  process,  and  the  plots  may  be  inserted  directly  into  the 

I data  reports.  Once  the  finished  plots  have  been  delivered,  the  tapes  are 

f automatically  degaussed,  and  the  film  is  handled  as  classified  waste. 

I Naturally,  there  is  a degree  of  flexibility  built  into  the  operation  to 

;j  handle  special  requirements  such  as  the  occasional  expedite  priority  job. 

And  not  all  graphics  at  White  Sands  are  processed  on  this  system.  There 
I are  a number  of  other  plotters  available,  and  various  organi zatioris  use  them  i 

ifor  special  purposes.  j 

As  carefree  as  the  operation  sounds,  it  has  not  been  without  its  problems.  i 

Perhaps  the  first  to  become  apparent  was  due  to  the  fact  that  the  system  < 

prints  in  only  one  color--black,  on  white  paper.  It  was  very  difficult  to 

convince  the  old-timers,  who  were  accustomed  to  multiple  ink  colors,  that  i 

they  could  live  just  as  well  with  plots  using  one  color  in  multiple  modes 
(line,  point,  symbol,  etc.).  Next  was  the  matter  of  size.  The  system  can 

produce  plots  up  to  11  x 17  inches,  which  is  really  quite  large,  but  the  1 

previous  plotters  could  handle  30  x 30  inch  paper.  No  one  knew  what  he  would 
do  with  a plot  greater  than  11  x 17,  but  many  were  still  hard  to  convince. 

Overall  reliability  did  not  turn  out  to  be  as  good  as  expected,  either. 

This  was  primarily  due  to  the  fact  that  there  are  three  major  links  in  the 
chain:  the  plotter,  the  developer,  and  the  printer.  If  just  one  of  them 

breaks  down,  the  entire  system  grinds  to  a halt.  The  problem  is  controlled 
primarily  by  increased  attention  to  preventive  maintenance.  The  particular 
model  of  plotter,  incidentally,  has  been  discontinued  by  the  manufacturer, 
and  at  some  future  date  this  may  impact  the  maintenance  program,  but  it 
■ has  had  no  effect  as  yet. 


With  the  increased  capability  of  the  system  came  increased  demand. 
Whereas  graphics  requests  had  been  running  at  a few  hundred  per  month,  peak 
output  under  the  present  system  has  soared  to  over  15,000  plots  per  month. 
The  Increased  use  of  graphics,  however,  has  cut  the  need  for  detailed  data 
listings  in  some  areas,  and  often  facilitates  the  analyst's  job.  Some 
samples  of  typical  output  from  the  White  Sands  production  graphics  system 
are  shown  on  the  following  pages. 
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Production  Graphics  System 
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MATHEMATICAL  TRADE-OFFS  FOR 
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John  L.  Lazaruk 

Systems  and  Economic  Analysis  Division 
Office  of  the  Comptroller 
ll.S.  Army  Communications  Cginmand 
Ft.  Huachuca,  Az.  85613 


ABSTRACT.  Recent  investigations  in  system  theory  have  generated  a 
mathematical  definition  of  a trade-off.  This  concept  arises  naturally  in  the 
context  of  economic  decisions,  systems  analysis,  and  systems  engineering. 

This  paper  presents  a mathematical  structure  of  a trade-off  between  system 
or  subsystem  attributes  which  insures  that  the  suboptimization  process  in- 
herent in  the  design  of  large  scale  systems  yields  an  optimization  of  the 
overall  system  design.  In  addition,  a methodology  is  developed  based  on  a 
hierarchy  of  functions  associated  with  tne  life  cycle  of  large  scale  systems 
using  mathematical  trade-offs  for  the  managerial  control  of  the  engineering, 
implementation,  and  operation  of  large  scale  systems. 

1.  INTRODUCTION.  Large  scale  system  exhibit  certain  characteristics 
such  as  an  overall  purpose,  the  interaction  of  humans  and  machines,  and  a 
variety  of  subsystems  which  transcend  the  expertise  of  a single  discipline. 
Generally,  the  user,  operator,  and  builder  are  distinct.  Examples  of  large 
scale  systems  are  military  weapon  systems,  communications  systems,  and 
social  welfare  systems. 

The  characteristics  of  a large  scale  system  lead  to  .a  hierarchy  of 
of  functions  corresponding  to  Figure  1. 


HIERARCHY  OF  SYSTEM  FUNCTIONS 


The  hierarchy  is  based  on  the  interrelationship  of  functional  purposes. 
The  overall  purpose  of  the  system  is  to  satisfy  the  requirements  of  the 
user.  The  user  requirements  are  satisfied  through  the  operation  of  the 
system  by  an  operator.  Thus,  the  system  is  built  for  the  operator  to 
satisfy  the  requirements  of  the  user. 

As  a result  of  this  tri-level  hierarchy  of  system  functions,  corres 
ponding  hierarchy  of  effectiveness  can  be  constructed  as  in  Figure  2. 


The  operational  effectiveness  level  of  a system  pertains  to  rharacteristlcs 
of  conditions  which  are  of  concern  or  are  experienced  by  the  user.  An 
example  of  a measure  of  operational  effectiveness  is  the  length  of  time  the 
system  takes  to  satisfy  a user's  requirement,  speed  of  service.  The  per- 
formance parameters  are  the  characteristics  or  conditions  which  are  of 
concern  or  are  experienced  by  the  operator.  An  example  of  a performance 
parameter  Is  the  percentage  of  the  time  a system  is  capable  of  performing 
its  intended  functions,  the  system  availability.  The  engineering  character- 
istics are  the  last  level  in  the  hierarchy  of  system  effectiveness  discussed  in 
this  paper.  The  engineering  characteristics  are  the  conditions  or  charac- 
teristics the  builder  uses  to  produce  a system,  an  example  is  the  mean  time 
between  failures,  MTBF  of  a system  component. 

These  three  aspects  are  also  visible  in  the  basic  structure  of  a system. 

The  black  box  model  of  a system  exhibited  in  Figure  3 can  be  related  to 
the  tri-level  hierarchial  structure  of  system  effectiveness. 


BLACKBOX  SYSTEM  MODEL 


INPUTS 


TRANSFORMATION 

PROCESS 


> 

OUTPUTS 

> 


FIGURE  3 


The  base  of  the  effectiveness  hierarchy,  engineering  characteristics, 
correspond  to  the  technical  specifications  of  the  transformation  process. 

The  performance  parameters  correspond  to  measurements  of  the  way  the 
transformation  process  performs.  The  operational  effectiveness  of  a system 
corresponds  to  measurements  of  the  way  the  system's  input/outputs  effect 
the  user. 

A major  implication  of  this  structure  in  the  management  of  large  scale 
systems  is  a delegation  of  responsibilities  and  a process  of  suboptimization. 
This  process  is  a result  of  decomposing  a system  into  components  which 
are  then  optimized  or  improved  independently  of  one  another.  What  is 
proposed  in  this  paper  is  a method  of  controlling  the  suboptimization 
process  to  insure  actions  taken  with  respect  to  the  components  are  compatible 
with  the  overall  objectives  of  the  system  and  its  optimization  by  using 
trade-off  functions  in  the  hierarchial  structure  described  above. 
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2.  MEASUREMENTS  AND  TRADE-OFF  FUNCTIONS: 


Coombs,  Raiffa,  and  Thrall  (reference  3)  define  a measurement  as  a 
function  from  a set  objects  to  a mathematical  system.  This  paper  is 
concerned  with  two  types  of  measurement:  measures  of  effectiveness  and 
preference  measures.  If  5 is  a set  of  systems  and  R is  a subset  of  real 
numbers,  a measure  of  effectiveness  is  a function  p;S-^R.  If  P is  a 
partially  ordered  set,  a function  p:S-^P  is  a preference  measure  if 
p(x)^p(y)  for  x,y£S  implies  x is  as  good  as  y. 

In  the  management  of  large  scale  systems,  there  are  generally  many 
attributes,  cha racter i s t i cs , or  conditions  for  which  a manager  is  responsible. 
If  {Pj:S-*-Pj  } represents  managerial  preference  with  respect  to  n different 

attributes,  a preference  measure  T : S^P  which  is  compatible  with  the  managerial 
preference  of  the  individual  attributes  should  have  the  following  property: 


(1)  If  x,yeS  and  Pj (x)^Pj  (y)  for  each  i,  then  T(x)^T(y). 

This  property  is  equivalent  to  one  of  two  properties  used  by  Wymore 
to  define  the  concept  of  a trade-off  between  orderings  in  reference  11. 

The  second  property  in  Wymore's  definition  is  equivalent  to 

(2)  If  x,ycS  such  that  p;(x)_>pj(y)  for  each  i and  there  exists  j 
such  that  Pj(x)^Pj(y),  then  "''(x)^  ’'^(y). 

As  is  often  the  case,  a measure  of  effectiveness  may  also  be  a 
preference  measure  where  the  partial  order  is  the  natural  order  of 
the  real  numbers.  For  a given  attribute,  more  is  generally  better. 

However,  it  is  possible  that  an  increase  in  a particular  attribute 
may  not  cause  one  system  to  be  preferred  to  another  system  with  a 
value  for  the  attribute.  In  fact,  the  US  Army  Logistics  Management 
Center  has  published  an  analysis  v;hich  includes  indifference  curves 
which  bound  regions  in  which  an  attribute  may  be  better  for  one 
alternative  than  another  and  yet  the  decision  maker  is  not  willing 
to  say  the  one  alternative  is  better  than  the  other.  This  means 
that  there  exists  situations  in  which  the  function  representing  the 
preference  for  the  combined  set  of  attributes  does  not  satisfy 
property  (2).  A function,  t:S-'P  will  be  called  a trade-off  function 
over  {p.:S-<-P.  } if  property  (1)  is  satisfied. 

The  next  Theorem  is  fundamental  to  the  methodology  for  the  management 
of  large  scale  systems  described  in  this  paper.  It  is  the  essential 
characteristic  of  trade-off  functions  for  use  with  the  hierarchial  manage- 
ment structure  for  large  scale  systems  which  was  described  in  Section  I . 

Theorem  1 : Let  A be  a set  of  preference  measures  of  a set  of  systems,  S,  and 
{Aj^  } a partition  of  A.  If  is  a trade-off  over  and  t:S-P  is  a 

trade-off  over  {Oj^:S+P|^}  , then  t is  a trade-off  over  A. 

Proof;  If  x,ycS  such  that  p(x)>  o(y)  for  each  pcA,  then  Ok(x)>  0|^(y)  for 
each  k.  Therefore,  t(x)>  T(y).  Thus,  T is  a trade-off  over  A. 


240 


A set  of  systems,  S,  is  complete  with  respect  to  {p.:S-*-Pj  },  a set  of 

preference  measures,  if  for  any  element,  a,  in  the  cartasian  product  of 
p (s)  there  exists  yeS  such  that  Pj  (y)^  Sj  for  each  i. 


Theorem  2:  If  S is  complete  with  respect  to  {p.rS  -*-P.  } and  T:S->-P  is  a 
trade-off  function  which  satisfies  property  (2)  and  each  F.  and  P are 

j 

linearly  ordered,  then  t(x)  is  a maximum  over  t(S)  if,  and  only  if, 

Pj (x)  is  a maximum  over  Pj  (S)  for  each  i. 


Proof:  If  T(x)^T(y)  for  each  yeS,  suppose  there  is  a j such  that 

Pj(y)>  Pj(x).  Since  S is  complete,  there  exists  zeS  such  that  Pj (z)^  Pj (x) 

for  i^j  and  Pj  (z)>;Pj  (y) , Since  t is  a trade-off  which  satisfies  property  (2) 

t(z)>  t(x).  Thus  t(x)  is  not  maximum  over  t(S). 

If  pj (x)  is  a maximum  over  p.  (S)  for  each  i then,  for  ysS  P| (x)^Pj (y)  for 
each  i.  Hence,  t(x)^  T{y)  since  x is  a trade-off. 


3.  MANAGERIAL  CONTROL  THROUGH  TRADE-OFFS 

The  structure  of  the  managerial  problem  in  the  design  of  large  scale 
systems  under  consideration  in  this  paper  has  four  elements: 

(1)  A set  of  characteristics  or  conditions  which  require  separate 
expertise  and  also  define  the  system  management  situation  in  the  sense 
that  preferences  in  the  performance  of  the  system  are  based  on  the 
characteristics  specified.  This  means  that  the  characteristics  and/or 
conditions  specified  are  complete. 

(2)  A method  of  measuring  each  of  the  characteristics  determined  in 
the  first  element  above. 

(3)  Standards  or  tolerances  which  are  used  to  compare  the  characteristics 
measures  in  order  to  insure  the  compatibility  of  the  attribute  values  with 
the  overall  objective  of  the  system. 

(4)  The  means  to  alter  the  characteristic  whose  difference  from  the 
standard  exceeds  the  tolerance. 

The  model  which  forms  the  basis  of  the  managerial  control  theory  presented 
in  this  paper  concentrates  on  the  first  three  elements.  It  is  assumed  that 
the  manager  by  definition  has  the  last  property.  What  we  are  concerned  with 
is  providing  the  responsible  managers  with  the  information  required  to  perform 
the  control  function  rationally  and  effectively  at  each  hierarchial  level. 
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The  first  step  in  the  methodology  of  managerial  control  of  systems  through 
trade-offs  is  to  establish  the  top  level  characteristics  or  conditions  to 
be  controlled.  The  top  level  attributes  in  the  management  of  large  scale 
systems  should  be  relatable  to  the  user  functions  in  the  functional  hierarchy 
described  in  Section  I . The  character i s t ics  or  conditions  for  the  operator's 
managers  to  be  concerned  with  are  traded  off  by  upper  level  attributes. 
Proceeding  in  the  same  manner  through  all  the  levels  of  the  functional 
hierarchy,  the  measures  established  at  each  level  are  trade-offs  of  lower 
level  attributes  and  are  traded  off  by  the  higher  level  attributes. 

Theorem  1 insures  that  the  higher  level  attributes  are  trade-offs  of 
subsets  at  each  lower  level.  Theorem  2 supplies  conditions  under  which  the 
suboptimization  process  necessary  in  large  scale  systems  is  consistent  with 
the  overall  objectives  of  the  system. 


1*.  TRADE-OFFS  IN  A COMMUNICATION  SYSTEM  (An  Example) 


The  purpose  of  a communication  system  is  to  transfer  information  between 
two  or  more  points.  Figure  3 is  a model  of  a simple  communication  system 
providing  circuits  between  two  points. 


Figure  k 


Suppose  the  measurable  purpose  or  overall  objective  of  the  system  is  to 
provide  a required  number  of  circuit  hours  per  month  where  a circuit  hour 
is  defined  as  one  circuit  operating  for  1 hour.  Availability  is  a measure 
of  the  time  a circuit  is  useable.  It  is  expressable  in  terms  of  two 
engineering  criteria  which  measure  the  maintainability  of  the  system. 

Mean  Time  Between  Failure  (MTBF),  and  Mean  Time  to  Repair  (MTTR) . 

The  availability,  Av,  is  defined  to  be  l^TBF 

MTBF  + MTTR 


If  X is  the  failure  rate  in  failures  per  hour  and 

in  repairs  per  hour,  then  P 

Av  - A + p 


is  the  repair  rate 


It  can  be  shown  that  the  availability  is  a trade-off  over  l/X  and  p. 
Assume  each  circuit  in  the  system  has  the  same  maintainability  character- 
istics. The  number  of  circuit  hours  per  day  for  the  given  system  is  a 
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trade-off  between  the  number  of  circuits  and  the  availability  of  each 
circuit  as  represented  by  the  following  equation: 

I CHD  = 2i»NAv 

i 

where  CHD  = Circuit  Hours  Per  Day 
N = Number  of  Circuits 
Av  * Circuit  Availability 

Figure  k summarizes  the  functional  hierarchy  and  the  appropriate  measures. 
FUNCTIONAL  LEVEL  MEASURE 


USER 

CHD 

OPERATOR 

N 

Av 

BUILDER 

N 

I/X 

P 


Figure  5 


The  standards  or  tolerances  on  N,l/X  and  p are  established  or  linked  to 
the  user  requirements  through  the  trade-off  relationship. 


BIBLIOGRAPHY 


1.  Kenneth  J.  Arrow,  Social  Choice  and  Individual  Values,  2nd  Edition, 

John  Wiley  t Sons,  1963. 

2.  L.D.  Attoway,  Criteria  and  the  Measurement  of  Effectiveness  in  Systems 
Analysis  in  Defense,  Ed. by  E.S.  Quade  6 W. i . Boucher,  American  Elsevier 
Publishing  Company,  inc..  New  York,  1968. 

3.  C.H.  Coombs  Howard  Raiffa,  and  R.M.  Thrall,  Mathematical  Models  and 
Measurement  Theory  in  Decision  Processes  Edited  by  R.  M.  Thrall,  C.H. 
Coombs,  and  R.L.  Davis,  John  Wiley  S Sons  195^  PP  19“37. 

k.  Cost  and  Operational  Effectiveness  Handbook,  TRADOC  Pamphlet  11-8. 

5.  Decision  Risk  Analysis  for  the  SUPER  DAWG  Missile  System,  US  Army 
Logistics  Management  Center,  Fort  Lee  Virginia,  ALM-63-3792-H . 

6.  Harry  H.  Goode  £ Robert  E.  Machol , SYSTEM  ENGINEERING  An  Introduction 
to  The  Design  of  Large-scale  Systems,  McGraw  Hill,  Inc.,  1957. 

7.  Richard  Johnson,  Fremont  Kast,  and  James  Rosenzweig,  The  Theory  and 
Management  of  Systems,  McGraw  Hill,  1973. 

8.  John  L.  Kelley,  General  Topology,  D.  Van  Nostrand  Company,  1955. 

9.  K.R.  MacCrimmon,  Decision  Making  among  Mu  1 1 i -At t r i bute  Alternatives: 

A Survey  and  Consolidated  Approach,  Rand  Corporation  Memorandum  RM-4823- 
ARPA  December,  I968. 

10.  Kenneth  0.  May,  Intransitivity  Utility  and  The  Aggregation  of  Preference 
Pattersons,  Econometr i ca , Vol.22,  January  195^.  Number  pp  1-13. 

11.  A Wayne  Wymore,  Systems  Engineering  Methodology  for  Interdisciplinary 
Teams,  John  Wiley  £ Sons,  Inc.,  1976. 

12.  Wayne  Wymore  £ Kingsley  Forry,  "Trades-off  in  Systems  and  Economics 
Analysis",  To  Be  Published. 


244 


RADAR  CROSS-SECTION  DATA  REDUCTION 


Ernest  J.  Sanchez 

National  Range  Operations  Directorate 
U.S.  Army  White  Sands  Missile  Range 
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Abstract 

The  main  theme  of  the  paper  is  the  exact  solution  for  the  normalized 
backscatter  cross-section  of  a perfectly  conducting  sphere  utilizing  Mie 

Theory.  The  Mie  equation  was  used  along  with  spherical  Bessel  and  Hankel 
functions.  Recurrence  Relations  and  Mie  Theory  with  proper  boundary  con- 
• ditions  applied.  The  paper  takes  you  through  step  by  step  procedure  of 
the  exact  solution  plus  the  tie-in  of  these  results  to  the  Radar  Cross- 
Section  Data  Reduction  System. 

The  problem  of  the  scattering  of  electomagneti c waves  from  a sphere 
has  received  considerable  attention  due  to  a large  extent  to  the  fact  that 
the  rigorous  solution  has  been  known  for  a long  time. 

The  rigorous  solution,  known  as  the  Mie  series,  allows  numerical  re- 
sults to  be  obtained  to  a high  degree  of  accuracy. 

The  subroutine  utilized  for  the  Mie  series  calculation  is  ADECRS.  The 
most  critical  part  of  subroutine  is  the  evaluation  of  the  required  Bessel 
and  Hankel  functions.  ADECRS  was  modified  from  a Fortran  II  prooram  written 
by  J.  Rheinstein  at  M.I.T.  for  calculating  the  scattering  of  electromagnetic 
waves  by  a lossless,  layered  spherical  Dielectric  or  Eaton  lens.  ADECRS 
contains  built-in  checking  procedures,  which  are  independent  of  the  calcula- 
tion algorithms  to  insure  that  sufficient  accuracy  in  these  functions  as 
well  as  in  other  parts  of  the  calculation  is  retained  to  give  at  least  five 
significant  figures  in  the  result.  The  results  from  ADECRS  have  been  com- 
pared with  other  published  data  for  accuracy. 

The  exact  solution  for  the  backscatter  cross-section  for  a perfectly 
conducting  sphere  is  computed  on  the  Univac  1108  computer  utilizing  double 
precision  throughout  the  Fortran  V subroutine  ADECRS. 

Following  the  method  of  Mie,  as  outlined  in  Stratton,  the  backscatter 
cross-section  of  a perfectly  conducting  sphere  is  represented  by 


o 


s 


? (-1)"  (2n  + 1)  (a^ 


The  paper  contains  an  introduction,  a mathematical  analysis  and  deriva- 
tion of  Radar  Cross-Section  equations;  a bibliography,  plus  17  illustrations 
and  a glossary  section. 
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A.  Proeram  Objective 

The  nCSl65  fAodule  is  a collection  of  subroutines  with  a main  program 
monitor  developed  for  the  A-Scope,  ACC,  and  AGC-VCO  radar  cross- 
section  (RCS)  data  reduction.  RCS165's  main  function  is  to  convert 
the  digitized  raw  data  to  useful  radar  cross-section  data  on  a daily 
production  basis. 

A new  version  of  the  Radar  Cross-Section  Data  Reduction  System  was 
deemed  necessary  due  primarily  to  the  many  additions  and  changes 
that  have  been  made  to  the  subroutines  for  greater  clarity,  complete- 
ness, efficiency  and  new  program  requirements. 

The  author  would  like  to  express  his  thanks  to  Liz  Duran  oior  secretary, 
for  her  tireless  effort  in  typing  this  new  document  version. 

B.  Changes  from  the  Previous  Version 

This  is  the  first  version  of  the  program  under  )/IPS. 

C.  Operating  Environment 

RCS165  utilizes  Fortran  V,  and  operates  on  the  Univac  llOS  with  the 
EXEC  8 operatir."  system  as  a Stand  Alone  Program.  This  module  is  of 
an  auxiliary  nature. 

Core  storage  area  is  conserved  by  the  utilization  of  overlaying  tech- 
nique (segmentation)  in  the  subsystem. 

A total  of  ten  major  output  formats  are  generated  by  this  system  to 
cover  all  present  V.’SK®  requirements.  Tape  Record  format  and  listing 
information  was  typed  so  as  to  allow  for  zeroxing  of  a specified 
format  as  requested  by  user  without  having  to  include  other  formats. 


246 


. MATimiATICAI.  ANALYr.IS  f>m  I.XIC 
A.  MATHEMATICAL  ANALYSIS 
1,  GLOSSARY 

The  following  glossary  is  provided  so  that  readers  may  find  most 
of  the  definitions  and  description  symbols  utilized  in  this  section  in  one 
place.  Many  of  the  terms  not  defined  here  will  be  defined  in  the  text  in  ] 

order  to  provide  greater  clarity  and  continuity.  A term  will  normally  be  ' 
underlined  to  denote  a definition.  I 

Sections  2 through  22  contain  the  derivation  of  the  major  equations  | 

utilized  in  the  radar  cross-section  data  reduction  subsystem  module,  RCS165.  ! 


( 

i 
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1.  GLOSSARY  (COf.’TimiED) 

A-SCOPE 

Type  A or  A-Scope  is  a type  of  data  presentation  where  the  echo  strength 
or  target  amplitude  is  indicated  by  vertical  displacement  (Y-axis)  of  the 
luminous  spot  and  range  by  horizontal  (X-axis)  displacement  on  a cathode- 
ray  tube.  There  is  no  angle  information  displayed  on  this  type  of  data 
presentation. 

Optical  cameras  are  setup  and  calibration  procedures  are  follov/ed  for 
A-Scope  recording  on  film.  The  data  is  recorded  on  35  im  film  in  three 
ways;  Type  I film  is  video  pulse  streak  camera  recording  (strip  film). 

Type  II  film  is  framing  camera  recording,  and  Type  III  film  is  intensity 
modulation  streak  camera  recording.  The  timing  on  film  is  IRIG  timing. 

This  technical  report  covers  the  data  reduction  of  strip  and  frame  film 
data.  A-Scope  type  data  is  primarily  utilized  when  target  discrimination, 
individual  RCS  for  entire  pulse,  peak  RCS,  or  other  signature  analysis 
is  required. 

Another  type  of  presentation  is  Type  P or  the  plan  position  indicator 
(PPI)  scope  where  a radial  displacement  of  the  spot  from  the  center  of  the 
screen  indicates  range  and  the  direction  of  the  radial  displacement  indicates 
azimuth  angle. 

A third  type  is  Type  B in  which  vertical  displacement  indicates  range 
and  horizontal  displacement  indicates  azimuth  angle.  There  are  many  other 
display  systems  in  current  use,  but  only  the  A-Scope  type  is  currently 
being  reduced  at  WSMR.  (See  figure  2.1) 


AUTOMATIC  GAIN  CONTROL  (AGC) 

The  function  of  the  AGC  is  to  maintain  the  d-c  level  of  the  receiver 
output  constant  and  to  smooth  or  eliminate  as  much  of  the  noiselike  amplitude 
fluctuations  as  possible  without  disturbing  the  extraction  of  the  desired 
error  signal  at  the  conical-scan  frequency.  One  other  purpose  of  the  AGC 
in  any  receiver  is  to  prevent  saturation  by  large  signals. 

I 

The  digitized  AGC  data  recorded  on  radar  field  tape  is  pre-calibrated 
I on  the  boresight  tov/er  in  5 db  steps  from  0 db-noise  level.  This  data 

is  then  stored  in  the  Milgo  ARCADE  Computer  memory  and  during  mission  the 
target  video  is  referenced  to  stored  AGC  precal.  ARCADE  is  an  abbreviation 
for  Automatic  Radar  Control  and  Data  Equipment.  The  reference  AGC  level 

Lis  then  recorded  on  radar  field  tape.  The  digitized  raw  data  is  produced 

as  a positive  displacement  above  the  noise  level.  Each  radar  launches  their 
own  spheres  and  conducts  the  sphere  calibrations  immediately  after  mission 
completion.  This  type  of  data  is  normally  converted  to  peak  RCS. 


1.  GLOSSARY  (CONTINUED) 


Signal 

amplitude 


sweep  calibrated  range 


A-SCOPE  DISPLAY  SYSTEM 
(Figure  2,1) 


1.  GLOSSARY  (CONTINUED) 

AUTOMATIC  GAIN  CONTROL -VOLTAGE  CONTROL  OSCILLATOR  (AGC-VCO) 

AGC-VCO  describes  a type  of  raw  data  available  from  the  FPS16's.  The 
detected  video  is  recorded  on  analog  tape  by  a telemetry  VTm-ilation  utilizing 
FM  methods.  The  data  is  later  digitized  to  machine  counts  and  from  there  Ic 
engineering  units  of  db  through  the  use  of  pre  or  post  step  calibrations.  This 
type  of  data  is  normally  converted  to  peak  RCS. 


BEAMWIDTH  (Gj^) 

The  symbol  theta  sub  b denotes  beamwidth.  The  width  of  the  beam  is 
usually  specified  by  the  beam  angle  between  half-power  points.  This  is  the 
angle  between  lines  on  opposite  sides  of  the  beam  axis  along  which  the  oower 
density  is  half  as  great  as  it  is  on  the  axis.  The  beam  angle  serves  as  a 
measure  of  the  angular  accuracy  and  angular  resolution  of  a radar  set. 


DECIBEL  (db) 


A unit  for  expressing  the  magnitude  of  change  in  electrical  power  level. 
The  bel  is  the  fundamental  division  of  a logarithmic  scale  expressing  the 
ratio  of  two  amounts  of  power,  the  number  of  bels  denoting  such  a ratio  being 
the  logarithm  to  the  base  10  of  this  ratio.  The  decibel  is  one-tenth  of  a 
bel.  For  example,  with  Pj  and  P2  designating  two  amounts  of  power  in  watts 
and  n the  number  of  decibels  denoting  their  ratio, 


n(db)  = 10  log 


Pi 

10  “Pi' 


DECIBEL  REFERENCED  TO  ON'E  SQUARE  METER  (dbsm) 

If  X is  in  square  meters  and  is  equal  to  1 square  meter  then 
X{dbsm)  = 10  logio(X/X^g^). 

In'  this  technical  report,  radar  cross-section  is  computed  in  square  meters 
first,  and  then  the  above  equation  is  utilized  to  compute  dbsm. 


I 
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1 . GLOSSARY  (CONTINUED) 

ELECTROMAGNETIC  WAVE 


An  electromagnetic  wave  consists  of  coupled  electric  (E)  and  magnetic 
(B)  field  oscillations. 


FOOT 

The  definition  of  a foot  is  the  one  used  by  the  United  States  Coast  and 
Geodetic,  rather  than  the  international  definition  of  one  inch  equals  2.54 
centimeters.  This  was  selected  since  all  the  site  surveys  are  based  on  this 
definition.  Exactly  3937  feet  are  equal  to  1200  meters  exactly.  These  values 
were  obtained  from  IRIG  Document  104-64,  "A  Glossary  of  Range  Terminology." 


FREQUENCY  (f) 

The  number  of  waves  (cycles)  that  go  by  a point  in  a unit  of  time 
(seconds).  It  is  normally  seen  as  megacycles  (Me)  for  radar  frequency;  such 
as,  5490  Me  per  second  for  an  FPS16. 


PEAK  TRANSMITTED  POWER  (P  ) 

S-  o' 

The  parameter  P is  actually  the  rms  power  during  the  pulse.  It  is 
usually  expressed  in  megawatts  or  10-  milliwatts. 


1 

POLARIZATION 

For  a plane  electromagnetic  wave,  the  electric  field  vector  must  always 
point  in  a direction  perpendicular  to  the  direction  of  propagation.  If  this 
direction  is  constant,  the  electric  field  lies  entirely  in  one  plane  and 
the  electromagnetic  wave  is  said  to  be  linearly  polarized.  If  the  direction 
rotates  with  time  at  a constant  rate  the  wave  is  said  to  be  elliptically 
polarized.  A circular  polarized  wave  is  a special  case  of  elliptical 
polarization.  (See  vertical  and  circular  polarization). 
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1 . glossary  (CONTI NUL'D) 


POLARIZATION  (CIRCULAR) 

If  the  electric  field  vector  appears  to  be  rotating  clockwise  to  an 
observer  looking  in  the  direction  of  propagation,  the  polarization  is  said 
to  be  right  circular  or  right  elliptical.  Counterclockv/i se  rotation  looking 
in  the  direction  of  propagation  is,  of  course,  designated  left  circular  or 
left  elliptical.  This  is  the  IEEE  standard  definition.  The  reader  should  be 
warned  that  many  authors  use  the  opposite  definition.  The  FPSlS's  did  have 
the  capability  for  circular  polarization  at  one  time,  but  this  has  been 
removed. 


POLARIZATION  (VERTICAL) 

Linear  polarization  can  be  divided  into  two  orthogonal  polarizations 
which  are  called  horizontal  and  vertical  polarization  with  the  planes  containing 
the  electric  field  lying  parallel  and  perpendicular  to  the  earth's  surface. 

The  FPSlG's  transmit  and  receive  a perpendicular  (vertical)  field.  Associated 
with  each  electric  field  there  is  a magnetic  field  whose  direction  is 
perpendicular  to  both  the  electric  field  and  the  direction  of  propagation, 
but  polarization  is  defined  in  terms  of  the  electric  field  vector. 


POWER  RETURN  OR  POWER  RECEIVED  (P^) 

The  radar  receiver  return  power  is  not  measured  directly  but  rather  is 
recorded  as  a power  level  (S)  so  many  db  above  (or  below)  the  receiver 
noise  power  (N).  The  receiver  noise  power  is  considered  to  be  constant  for 
all  practical  purposes  but  may  contain  a slight  drift.  The  raw  data  from 
A-Scope,  AGO,  AGC-VCO,  and  the  sanborns  (strip  charts)  is  therefore 
a record  of  the  ratio  (S/N)  expressed  in  decibels  (db).  All  four  types 
of  raw  data  can  be  expressed  as  a positive  displacement  above  noise  or  in 
direct  proportion  to  the  step  calibrations. 
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The  poynting  vector  describes  the  flow  of  energy  in  an  electroniagneti c 
wave.  Its  direction  is  that  of  tlie  wave  and  its  magnitude  is  equal  to  the 
rate  at  which  energy  is  being  transported  by  the  wave  per  unit  cross-sectional 
area  (watts/in^). 


The  variations  in  an  electromagnetic  wave  occur  simultaneously  in  both 
fields  so  that  maxima  and  minima  occur  at  the  same  times  and  places.  The 
direction  of  the  electric  and  magnetic  fields  are  perpendicular  to  each  other 
and  to  the  direction  the  wave  is  moving.  Waves  are  therefore  transverse. 

The  speed  of  the  waves  depends  only  upon  the  electric  and  magnetic  properties 
of  the  medium  they  travel  in  and  not  upon  the  amplification  of  the  field 
variations . 


PROPAGATION  VELOCITY 

The  propagation  velocity  is  the  velocity  of  light  in  a vacuum.  The 
value  obtained  from  IRIG  Document  104-64  is  299.7925  meters  per  microsecond. 


PULSE  DURATION  (t) 

The  symbol  tau  denotes  pulse  duration  which  is  the  length  of  time  a 
transmitter  emits  energy.  Pulse  duration  determines  range  resolution.  A 
pulse  duration  of  one  microsecond  would  result  in  a range  resolution  cell 
of  approximately  492  feet  or  1 microsecond  = 149.89625  meters. 


253 


1.  GLOSSARY  (CONTIKUCD) 

PULSE  REPETITJOf.'  FREQUCNCY  (PRF) 

PRF  is  the  rate  at  which  pulses  are  transmitted  for  a periodic  pulse 
train;  for  example,  ICO  PRF  or  160  pulses  every  10'  _ seconds.  One  of 
the  uses  of  a radar's  PRF  is  to  detennine  maximum  range  from  which  echoes 
can  be  returned  without  ambiguity.  The  propagation  velocity  (c)  is  divided 
by  2PRF.  An  example  would  be  R(max)  = 299.7925  m per  u sec  divided  by 
2(160  pulse  per  10^  u sec)  which  is  approximately  equal  to  936,852  meters 
or  about  580  statute  miles. 


RADAR  CROSS-SECTIOR  (a)  or  (RCS) 


The  symbol  sigma  or  RCS  denotes  radar  cross-section  or  alternatively 
back-scattering  cross-section.  It's  defined  as  the  area  intercepting  that 
amount  of  power  which,  when  scattered  isotropically,  produces  an  echo  equal 
to  that  observed  from  the  target.  An  alternate  definition  is  the  cross- 
sectional  area  of  a perfectly  conducting  sphere  that  would  return  the  same 
power  to  the  radar  as  does  the  actual  target.  RCS  is  normally  expressed  in 
or  dbsm.  A third  definition  is  the  area  which  would  intercept  sufficient 
power  out  of  the  transmitted  field  to  produce  the  given  echo  by  isotropic  re- 
radiation. All  three  definitions  are  essentially  equivalent  to  each  other. 

The  FPS16's  are  monostatic  or  back-scatter  radars.  This  means  that  the 
receiving  and  transmitting  antennas  are  one  and  the  same.  Back-scatter 
cross-section  of  a sphere  (o  ) is  the  value  in  or  dbsm  computed  in  sub- 
routine ADECRS  by  the  utilization  of  Hie  theory  for  exact  solution  of  a 
perfectly  conducting  sphere. 


Range  is  defined  as  the  propagation  velocity  multiplied  by  delta  time 
divided  by  2.  Delta  time  is  the  time  required  electromagnetic  energy  wave 
to  travel  out  to  object  being  tracked  and  back. 
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GLOSSARY  (CONTIfUmU) 
SIGNAL  STRENGTH  (S) 


A measure  of  the  power  output  of  a radar  at  a particular  location, 
nonnally  the  radar  receiver.  To  measure  signal  strength  (S)  at  radar 
receiver,  a sphere  calibration  should  be  utilized. 


SIGNAL-TO-NOISE  RATIO  (S/N) 

In  radar,  the  ratio  of  the  value  of  the  signal  to  that  of  the  noise. 

The  ratio  is  expressed  in  decibels  and  is  expressed  as  the  ratio  of  root- 
mean-square  signal  voltage  during  the  pulse  to  root-mean-square  noise  voltage, 
as  measured  at  any  point  in  the  i-f  portion  of  the  receiver  following  the 
introduction  of  substantial  gain  and  restriction  of  the  noise  bandwidth. 


WAVELENGTH  (x) 


The  symbol  lambda  denotes  the  wavelength  of  the  energy  transmitted 
by  radar  in  meters  per  cycle.  It  is  calculated  by  dividing  the  propagation 
velocity  (c)  by  the  frequency  of  radar. 


WHITE  SANDS  CARTESIAN  SYSTEM  (WSCS)  LEFT  HAND  SYSTEM 
— 

A Cartesian  coordinate  system  with  origin  at  the  intersection  of  latitud 
((j)o)  330  OS'O.OOO"  North  and  longitude  (Xq)  106°  20'0.000"  West.  At  this  ori 
the  XY  plane  is  tangent  to  the  Clark  Spheroid  of  1865  with  the  semi-major  axi 
(a)  equal  to  6,37E,206.4  meters  and  the  semi-minor  axis  (b)  equal  to 
6,356,583.8  meters.  The  eccentricity  squared  major  (e^)  = l-b^/a^  is  equal 
to  0.00676  86579  97291.  The  eccentricity  squared  minor  ((eM^)  is  equal  to 
0.00681  47849  45915.  The  origin  has  a value  of  500,000.00  feet  East  (Eq), 
500,000.00  feet  North  (N^),  and  0.00  feet  for  (Z  ).  The  Y-axis  is  an 
east-west  line  in  the  tangent  plane  passing  through  the  origin  and  increasing 
positively  eastward.  The  X-axis  is  a north-south  line  in  the  tangent  plane 
passing  through  the  origin  and  increasing  positively  northward.  The  Z-axis 
is  perpendicular  to  the  tangent  plane  and  increasing  positively  uoward.  There 
are  numerous  systems  available  for  WSMR-provided  data  which  are  fully  documented 
in  Data  Reduction  Division  Technical  Report  titled,  "Coordinate  Systems  Related 
; to  WSMR,"  July  1964  where  above  values  were  obtained. 
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2 . RAnAR  RAUGC  EQUATION 


In  this  derivation,  no  propagation  effects  are  considered.  The  radar  se 
and  the  missile  are  assumed  to  be  isolated  bodies  in  space.  The  antenna  is 
positioned  so  that  maximum  radiation  is  directed  toward  the  object. 


The  term  "target"  will  refer  to  the  object  whose  radar  cross  section  is 
desired. 

Let  R = the  slant  range  from  the  radar  to  the  target 

Pp  = the  peak  pov/er  of  the  rectangular  pulse  transmitted  from  the  antenna. 

This  peak  power,  P , is  spread  over  the  spherical  wavefront  traveling  cut- 
ward  from  the  antenna.°  When  the  wavefront  reaches  the  target,  the  area  of  its 
spherical  surface  is  4tiR2,  and  if  the  radiation  were  uniform  in  all  directions 
the  power  density  at  the  target  would  be  Pq/4ttR2.  Because  the  antenna  is 
directional,  the  distribution  of  power  is  not  uniform.  The  power  density  is 
greater  than  Pq/4tiR2  at  the  target  and  less  at  other  positions  on  the  wave- 
front.  Thus  the  poiver  density  at  the  target  is 


(2.1)  (Po/4t,R2)G^ 

where  G.  is  a factor  greater  than  1 to  take  account  of  the  concentration  of 
energy  in  the  direction  of  the  missile.  A fictitious  antenna  that  radiates 
unifonnly  in  all  directions  is  called  an  isotropic  antenna,  and  G^  is  called 
the  power  gain  of  the  transmitting  antenna  relative  to  an  isotropic  antenna. 
The  gain  G^  is  the  ratio  of  the  actual  power  density  at  the  target  to  the 
power  density  that  would  be  produced  by  an  isotropic  antenna.  Part  of  the 
transmitted  wave  is  reflected  from  the  target.  Wavefronts  of  the  reflected 
wave  are  expanding  spheres  centered  at  tne  target.  Suppose,  temporarily, 
the  target  is  a sphere  having  a perfectly  conducting  surface  and  cross 
sectional  area.  Such  a sphere  collects  the  power  in  an  area  a of  the  incident 
wave  and  reradiates  this  power  uniformly  in  all  directions,  provided  the  dia- 
meter of  the  sphere  is  large  relative  to  the  length  of  the  radio  waves.  The 
reflected  wavefront  has  an  area  of  4iiR2  when  it  reaches  the  radar  antenna,  and 
therefore  the  power  density  of  the  reflected  wave  at  the  radar  antenna  is 

(2.2)  (P£,/4itR2)(G^)(o/4i<R2) 

Actual  targets  may  extract  more  or  less  energy  from  the  transmitted  wave  than 
that  contained  in  their  cross  sectional  area.  Furthermore,  they  absorb  some 
of  this  energy  and  reradiate  only  the  remaining  part,  and  the  radiation  is 
usually  far  from  uniform  in  direction.  Nevertheless,  the  reflecting  ability 


I 


! 

i 


J 
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of  any  target  can  be  described  by  an  area  o,  called  the  radar  cross  section 
of  the  object,  which  is  the  cross  sectional  area  of  a pericTTly  co‘nducVing~ 
sphere  that  would  return  the  same  power  to  the  radar  as  does  the  actual  target. 
Equation  2.2  thus  applies  to  all  targets  if  o is  considered  to  stand  for 
radar  cross  section. 

The  power  absorbed  from  the  reflected  wave  by  the  radar  antenna  is  propor- 
tional to  the  power  density  of  the  wave  when  it  reaches  the  antenna.  Therefore, 
the  received  power,  P , is; 


(2.3)  = (Pq/4.R-^)(G^)(o/4t,R2)A 

(The  Radar  Equation) 

= (PoGj.oA)  / (ArjSR'^ 


where  A is  the  constant  of  proportionality.  This  constant  has  units  of  area 
and  is  called  the  effective  area  of  the  receiving  antenna.  Equation  2.3  is 
knovm  as  the  radar  range  equation. i 

In  the  FPSlG's  the  antenna  disk  is  a circular  paraboloid  which  is  the 
surface  generated  by  rotating  a parabolic  curve  about  its  axis.  The  prop- 
erties of  the  parabola  which  makes  it  particularly  useful  for  focusing 
radiant  energy  into  a directional  beam  are  characterized  by  two  ray  con- 
siderations: First,  any  ray  from  the  focus  is  reflected  in  a direction 
parallel  to  the  axis  of  the  parabola;  and  second,  the  distance  traveled  by 
any  ray  from  the  focus  to  the  parabola  and  by  reflection  to  a plane  per- 
pendicular to  the  parabola  axis  is  independent  of  its  path,  and  therefore 
such  a plane  represents  a wave  front  of  uniform  phase. 

Calculations  of  the  antenna  efficiency  based  on  the  aperture  distri- 
bution set  up  by  the  primary  pattern  as  well  as  the  spillover  indicate 
theoretical  efficiencies  of  about  80  per  cent  for  paraboloidal  antennas 
when  compared  across  the  aperture,  poor  polarization  characteristics,  and 
antenna  mismatch  reduce  the  efficiency  to  the  order  of  55  to  65  per  cent 
for  ordinary  parabolic  reflector.  It  should  be  noted  that  the  antenna 
aperture  of  a parabolic  reflector  is  the  area  projected  on  a plane 
perpendicular  to  its  axis  and  is  not  the  surface  area.*' 

The  aperture  efficiency  factor,  Neta  sub  a (n,)  is  included  for  variations 
in  illumination  over  the  aperture.  A circular  aperture  with  edge_^i  1 lumi nation 
reduced  to  suppress  rate  lobes  will  have  n equal  to  60  per  cent.^^ 
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2.  RADAR  RANGE  EQUATION  (CONTINUCD) 

The  theoretical  gain  for  circular  aperture  may  then  be  represented  by 

AnAn,  4Ti(itr^)n 

(2.4)  ^ = 5. 

a2 


r 

if  — >>  1 where  r is  the  radius  of  the  reflector  and  lambda  (the  wave- 
length) is  the  propagation  velocity  divided  by  the  frequency  of  radar. 25 
For  the  FPS16,rA  is  approximately  equal  to  32. 

The  half  angle  subtended  by  paraboloid  at  focus  is  about  35  degrees 
for  a gain  efficiency  of  aperture  of  about  60  per  cent. 

The  beamwidth  in  degrees  may  be  approximated  utilizing  the  following 
equation  for  circular  aperture. 


(2.5) 


„ y 1.2X  /360  s 
‘b  ‘ ’ 


D 


2ir 


where  D is  the  diameter  of  reflector. 

Thuf  equati on  2. 3 may  be  rewritten  in  the  form: 


(2.6) 


PoG^G^oX2 


(47i)3R‘^ 


If  the  same  antenna  is  used  for  transmitting  and  receiving  we  have  = G^  = G 
and  we  may  write  2.3  as 


(2.7) 


P G2x2o 
0 


(4Tr)3R‘* 


This  latter  form  will  be  used  in  the  rest  of  the  section  to  derive  the 
necessary  formula.. 
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3 . BACKSCATTER  CROSS-SECTIO;;  FOf<  A PCRFLCTLY  CONDUCTING  SPIILRi:  (j.) 


The  problem  of  the  scattering  of  electromagnetic  waves  from  a sphere 
has  received  considerable  attention  due  to  a large  extent  to  the  fact  that 
the  rigorous  solution  has  been  known  for  a long  time. 

The  rigorous  solution,  known  as  the  Mie  series,  allows  numerical  results 
to  be  obtained  to  a high  degree  of  accuracy. 

The  subroutine  utilized  for  the  Mie  series  calculation  is  RCSCRS.  The 
most  critical  part  of  subroutine  is  the  evaluation  of  the  reouired  Bessel 
and  Hankel  functions.  PS^C'.-.Z  was  modified  from  a Fortran  11  program  written 
by  J.  Rheinstein  at  H.l.T.  for  calculating  the  scattering  of  electromagnetic 
waves  by  a lossless,  layered  spherical  Dielectric  or  Eaton  lens,  see 
references.  RCSCRS  contains  built  in  checking  procedures,  which  are  in- 
dependent of  the  calculation  algorithms  to  insure  that  sufficient  accuracy  in 
these  functions  as  well  as  in  other  parts  of  the  calculation  is  retained  to 
give  at  least  five  significant  figures  in  the  result.  The  results  from 
RCSCRS  have  been  compared  with  other  published  data  for  accuracy. 

The  exact  solution  for  the  backscatter  cross-section  for  a perfectly 
conducting  sphere  is  computed  on  the  Univac  1108  computer  utilizing  double 
precision  throughout  the  Fortran  V subrouti ne  RCSCRS . 

Following  the  method  of  Mie,  as  outlined  in  Stratton, the  backscatter 
cross-section  of  a perfectly  conducting  sphere  is  represented  by 


(2.8) 


“ (-1)"  (2n  + 1)  (a^^  - b^)  2 

n = 1 " " 


Since  the  values  of  the  real  and  imaginary  coefficients  Re  Ini 

Re  Im  {b^},  decrease  rapidly  for  n > p,  it  is  necessary  to  carry  summation 

only  to  n = 2p  + 5 or  a maximum  value  for  n of  about  245  for  r/\  = 19.00.  The 
following  notation  is  utilized. 


(2.9) 


p = 


llL 

X 


where  r is  the  sphere  radius,  and  X = velocity  of  light/frequency. 


i 


k 


m 


1 


3. 


BACKSCATTER  CROSS-SECTION  FOR  A PCRFECTLY  CONDUCTir.'G  SPHERE  ( c ,)  (COMIIulP) 


i 


(2.10)  (p) 

, , f2) 

where  (p)  and  h^'  are  spherical  Bessel  and  Hankel  functions  and 
primes  denote  di ffernti ati on  with  respect  to  argument. 


(2.11)  = -[p  (p)]'  / [p  h^  (p)] 


The  above  equation  was  utilized  along  with  spherical  Bessel  and  Hankel 
functions  which  are  evaluated  right  in  the  program  RCSCRS  by  the  use  of 
recurrence  relations. 

Most  of  the  problems  and  difficulties  that  are  encountered  are  related 
to  the  generation  of  the  necessary  spherical  Bessel  and  Neumann  functions 
and  to  the  evaluation  of  the  determinants. 

The  spherical  Bessel  and  Neumann  functions  are  calculated  by  the  sub- 
stitution of  Ap(p)  and  A_^(p)  for  jp(p)  and  n^(p),  respectively.  These 

functions  are  related  by 


(2.11-1)  J^(p)  = p"  A^(p)  ^'VlTI 


and 

(2.11-2) 


n^(p)  = p 


-(n+1)^.  (p)  (-1)?(2n)! 


-n 


n! 


The  function  A ^(p)  can  be  evaluated  very  simply  since 


(2.11-3) 


A_o(p)  = COS  X 


J 
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3.  BACKSCATTER  CROSS-SECTION  FOR  A PERFECTLY  CONnUCTlNG  SPHERE  {c. 


(2.n-4) 


A_l(p)  = psinp+  cos  p 


and  the  recursion  relation 


(2.11-5) 


-n  -1 


(p)  = A „(p)  - 


-n' 


4(-n  - 0.5)(0.5  - n) 


can  be  used  for  calculating  higher  orders. 

The  function  A^(p)  is  more  difficult  to  evaluate. 

Assume  that  the  argument  is  p and  the  largest  desired  order  is  n 
k equal  to  approximately  the  maximum  of  m + 20 , p + 30  or  35.  Then 


(2.11-6) 


Hj,(p)  = 0.0 


(2.11-7) 


H^_.,(p)  = 10-6 


then,  employing  the  recursion  relation. 


(2.11-8) 


^.i  4 1 (P) 


4(j  + 0.5)(j  + 1.5) 


determine  all  H.(p),  j = k - 2,  k - 3,  . . . , 1 

V 


)(C0NTlNl)rD) 


I.  Choose 
set 
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3.  BACKSCATTER  CROSS-SECTION  FOR  A PERFLCTLY  CONDUCTING  SPHERE  (o^) (CONTINUED) 
Evaluate 


(2.11-9) 


A (p)  = - cos  p)  ^ 

^ ft  n2 


and  determine  the  ratio 

(2.11-10)  Y = A (p)  / H (p) 

1 1 


It  will  then  be  found  that 


(2.11-11)  A^(p)  = YH^(p) 


where  n = 2,3,4,  . . . , m 

and  that  the  required  functions  are  evaluated. 

The  functions  obtained  in  this  manner  may  be  checked  by  employing  the  relation 


(2.11-12) 


A„(p)  A 


-n  - 1 


(p)  - 


4(n  + 0.5)(n  + 1.5) 


The  determinants,  as  employed,  have  a form  which  may  be  evaluated  in  a 
relatively  simple  manner.  If  c(i,j)  denotes  the  element  in  row  i and  column 
j,  the  nonzero  terms  are  c(i,i-l),  c(i,i),  c(i,i+l),  c(i,i+2),  for  i odd,  and 
c(i,i-2),  c(i,i-l),  c(i,i),  c(i,i+l),  for  i even.  However,  c(l,0),  c(2,0), 
c(2N-l  ,2N+1 ) , and  c(2N,2N+l)  do  not  exist. 

After  some  manipulation  of  the  determinants,  the  Mie  coefficients  may  be 
put  into  a form  such  that 


(2.11-13) 


2 


1 


i 


i 

1 


i 

i 


i 


262 


The  determinants  N„  and  D are  identical  except  for  the  terms  c{l,l)  and 

c(2,l).  Similar  forms  are  obtained  for  the  coefficients  b by  simply  multi- 
plying each  element  of  rows  2i  + 1 , i = 0 ,1  ,2 , . . . by  the  factor  c(i  + 1). 
where  e = 1 for  a dielectric  in  free  space.  All  c(ij)  are  real. 

If  the  above  notation  is  employed,  the  determinant  may  be  evaluated  by 
the  following  scheme. 

Evaluate 

(2.11-15)  B(2.2)  = c(l,l)  c(2,3)  - c(1.3)  c(2,l) 


(2.11-16)  B(2,l)  = c(l.l)  c(2,4)  - c(l,4)  c(2,l) 


I = 3 


(2.11-17)  B(l,l)  = c(I.I-l)  B(2,2)  - C(I.I-2)  B(2,l) 


(2.11-18)  B(l,2)  = c(I  +1,1-1)  B(2,2)  - c(I  + 1,  1-2)  B(2,l) 


(2.11-19)  B(2,2)  = c(I  + 1,  I)  B(l,l)  - c(I,I)  B(l,2) 


(2.11-20)  B(2,l)  = c(I  + 1,  I + 1)  B{1,1)  - c(I,  I + 1)  B(l,2) 


Then  set  1=1+2  and  recompute  B(l,l),  B(l,2),  etc.  The  value  of  the  ! 

determinant  is  the  value  of  B(2,2)  when  I has  reached  the  value  2N  - 1 . 

This  scheme  for  evaluating  the  determinants  was  found  by  employing  Laplace's 
expansion  by  minors.  As  a check,  the  determinants  are  evaluated  a second  time 
by  employing  a similar  scheme  found  by  a different  form  of  Laplace's  expansion. 

A simplified  flow  chart  follows  to  aid  readers  through  subroutine 
RCSCRS.  (see  figure  2.2-1) 
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BACKSCATTCR  CROSS-Sc  CTIC^  FOR  A PcPFrcTLY  CO'iDCCTinG  SPHERE  (a  ) (COUTHHJED) 


Read  in  data 


Calculate  arauments  for  functions 


Calculate  A and  A 

n -n 


Test  A and  A 

n -n 


error 


pn  ntcut 


Evaluate  elements  of  determinant  for  a 


Evaluate  determinant 


erent  c(l  ,1)  and  c{2,l) 


Determine  Mie  coefficient! 


Comoute  and  write  backscatter  cross-section 


(Figure  2.2-1) 
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The  data  in  table  and  plots  has  been  normalized  so  as  to  be  dimensionless. 
The  following  table  is  included  to  orovide  the  position  of  the  minima  and 
maxima  of  the  normalized  backscatter  from  a conducting  sphere. 


POSITION  OF  MAXIMA 

CROSS-SECTION 

POSITION  OF  MINIMA 

CROSS-SECTION 

r/X 

o/itr^ 

r/X 

o/ 

0.1636 

3.65495 

0.2775 

0.285041 

0.3717 

1.96958 

0.4707 

0.505600 

0.5659 

1.58864 

0.6626 

0.634592 

0.7578 

1.41048 

0.8543 

0.716232 

0.9495 

1.30698 

1 .0458 

0.772456 

1.1413 

1.24023 

1.2374 

0.813423 

1.3331 

1.19398 

1.4292 

0.844286 

1.5249 

1.16016 

1.6210 

0.868137 

1.7169 

1.13451 

1.8130 

0.887011 

1.9089 

1.11453 

2.0051 

0.902239 

2.1011 

1.09864 

2.1973 

0.914714 

2.2933 

1.08576 

2.3896 

0.925057 

2.4857 

1.07515 

(Figure 

2.3) 

The  following  plots  ivere  generated  from  RCSCRS  data  and  were  plotted  to 
four  significant  figures  utilizing  the  EAl  3440  digital  plotter. 
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4.  SIGNAL  STRENGTH 

Signal  strength  (S)  is  the  measurement  of  the  power  output  of  a radar  at 
a particular  location.  To  generate  (S),  a sphere  calibration  should  be  employed. 
The  signal  strength  in  milliwatts  may  be  computed  utilizing 


I 

1 

1 

( 

S(dbm)  = 10  log  (S/1  milliwatt) 


(2.12) 

(2.13) 


j 


! 

s 
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5.  SIGNAL-TO-NOISE  RATIO  (S/N) 


I 

I 

1 


(2.14) 


PpG2x2o 

(4Ti)3kT^BNT^LR‘* 


This  i-f  or  "single  pulse"  S/N  ratio  does  not  measure  the  over-all 
effectiveness  of  the  radar,  but  serves  as  an  intermediate  step  in  a number 
of  further  calculations. 

Departure  from  free-space  propagation  conditions  must  be  accounted  for 
by  appropriate  components  of  the  total  system  loss  factor  L or  by  modificaticr, 
of  the  antenna  gain  G. 

The  transmitted  power  is  assumed  to  lie  within  the  bandwidth  B of  the 
receiver,  after  reflection  from  the  target  and  propagation  through  the  medium. 

In  the  above  equation  NF  is  the  operating  receiving  system  noise  factor, 

S is  the  signal  strength,  k °is  Boltzmann's  constant,  (1.38  X 1C  “23  v/att  per 
cps  per  degree  Kelvin),  Tq  = 290°  K,  B is  the  equivalent  noise  bandwidth  of  \ 

the  receiver  in  cps.  A more  rigorous  discussion  of  the  above  may  be  found  i 

in  Radar  System  Analysis  by  David  K.  Barton  where  this  section  has  been  ; 

taken  from.  I 


i 


6.  SPHERE  EQUATION 


The  sphere  equation  is  basically  the  radar  range  equation  in  which  the 
units  of  P have  been  converted  from  watts  to  db.  We  will  assume  in  this 
subsection^that  the  values  P^,  G,  X,  and  c in  equation  2.14  are  constant. 
The  assumption  that  o and  aspect  angles  be  constant  at  all  times  means,  for 
all  practical  purposes,  that  a sphere  must  be  the  object  tracked. 

With  the  above  assumptions  equation  2.14  becomes: 

(2.15)  ^r  “ / (4n)3kT^BNF^L 

By  taking  the  log  of  both  sides  of  2.15  we  obtain 


(2.16) 


log  P^  = log  K - 4 log  R 


Multiply  both  sides  of  2.16  by  10 


(2.17) 


10  log  Pj.  = 10  log  K - 40  log  R 


Since  10  log  P^  = P^  (db)  + 10  log  P^^^  for  P^  in  watts  and  P^^^  the 
reference  power,  we  may  substitue  in  2.17  and  get 


(2.18) 


P^(db)  + 10  log  = 10  log  K - 40  log  R 


Pj.(db)  = - 40  log  R + 10  log  K - 10  log  P^^^^ 


But  K and  P^^^  are  constant  and  thus  we  can  make  the  substitution 


b = 10  log  K - 10  log  P^^^ 


And  obtain 


(2.19) 


Pj,(db)  = - 40  log  R + b (The  sphere  equation) 


Thus  the  received  power  in  db  varies  linearly  as  the  log  of  the  range  when 
the  target  being  tracked  is  a sphere  or  has  constant  cross  section. 
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6 . SPHERE  EQUATION  (CONTINUED) 

A BBAR  is  then  computed  to  be  utilized  in  cross-section  equation  later. 
The  equation  is  written; 


(2.20)  F(db)  = 1 ^ ^ [Pr,.(db)  + 40  log  R.] 


Or 


(2.21)  b(db)  = 1 E?  ^ ^ b. 


7 . CROSS-SECTION  EQUATION 


The  problem  v/i th  using  the  range  equation  directly  to  solve  for  radar 
cross  section  is  that  of  detennining  G,  and  While  P and  > arc  not 
difficult  to  calculate,  G is  extremely^hard  to  find  in  a pr§ctical  situation. 
The  reason  is  that  receiver  gain  must  also  be  considered.  Henceforth  G vnll 
be  used  to  mean  "power  gain"  in  the  system.  Also,  it  may  be  possible  on 
some  radars  to  vary  P and  >.  We  may  assume  hov/ever,  that  for  a short  time 
interval,  say  one  or  two  hours,  that  P^,  and  G,  and  x remain  constant.  This 
assumption  means  that  the  radar  operators  must  not  change  these  values  during 
this  time  period. 


In  this  subsection  the  notation  will  be  as  previously  described  but  an 
additional  subscript  t will  mean  that  term  refers  to  the  target,  an  s will  mean 
the  term  refers  to  the  sphere,  e.g.,  P has  been  used  for  received  power, 
thus  Prt  means  power  received  from  the  target  and  P means  power  received 
from  the  sphere. 

To  derive  the  cross  section  equation  we  will  divide  the  range  equation 
for  the  target  by  the  range  equation  for  the  sphere,  then  solve  the  resultant 
for  0*  . It  is  assumed  that  throughout  the  target  and  sphere  tracks  that  P^, 

G,  ana  x remain  the  same.  Thus  we  have: 


(2.22) 


rt 


P G2x2o. 
0 t 


(2.23) 


Therefore 


rs 


P G2x2o 
0 

(4ti)3R^ 


£ 

4 


(2.24) 


The  cross-section  equation  2.26  is  further  changed  by  letting: 


o 


Taking  the  log  of  both  sides,  then  multiplying  both  sides  by  10  and 

......  , X V M 1 „ -7  equation  2.27  we  get 

using  definition  log  -yj  =logX-logY-logZ 

(2.28)  10  log  RFAC  = 10  log  - (10  log  + 40  log  R^) 

Since  X(db)  = 10  log  (—y— — ),  we  may  substitute  in  2.28  and  we  have 

^ref 

RFAC(db)  = log  - [P^(db)  + 40  log  R^] 

Then  utilizing  2.20  we  get 


(2.29)  RFAC(db)  = 10  log  - b 

Takina  the  antiloQ  of  2.29  we  obtain  in  \ratts. 


7 . CROSS-SECTION  EQUATION  (COtiTlNUED) 

Taking  the  anti  log  of  power  received  from  target,  we  have  in  watts, 


(2.31)  = lOO-i 


All  parameters  are  now  in  proper  units  and  are  used  conjunctional ly  to  fonn 
new  cross-section  equation  as  follows: 


(2.32).  o^(m2)  = P^R^‘*RFAC  (The  Cross-section  equation) 


To  obtain  in  db  per  square  meter  (DBSM),  we  use  the  definition 
X 

X(db)  = 10  log  ( ) where  X^^^  = 1 m^ 

^ref 


and  we  get 


(2.33)  o^(dbsm)  = 10  log  (m^) 
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8 . ASPECT  OR  VIEW  ANGLE  ( o) 


The  aspect  or  view  -ingle  denoted  by  the  symbol  theta  is 
the  total  velocity  vector  and  the  slant  range  vector.  This 
defined  as  the  angle  between  unit  vector  along  longitudinal 
and  a unit  vector  along  the  radar  line  of  sight. 


the  angle  between 
can  also  be 
axis  of  target 


miscile  path 


V 

\ 't 


^ 


LAUtiCH  SITE 


-SB  Nfo 


(Figure  2.9) 

The  vector  SR  is  defined  as  follows; 


\ 


V <_ 

N. 


radar  LO? 


RADAR  SITE 


Q 


(2-34) 


SR  = + D + D, 

X y 2 


From  the  distsoce  fcrnula  we  obtain  the  magnitude  of  the  slant  range 
vector  as  follcwt*  ^ 


(2.35) 

(2.36) 
where 

(2.37) 


SR  = jS^ 


SR  . * 0^2  * 


“x  ^ ■ ^rad* 


“y  ■ (''tgt  - Vad' 


“z  ■ (Ztgt  - ^rad) 
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8 . ASPECT  OR  VIEW  ANf.LE  (o)  (CONTINUED) 
The  vector  is  defined  as  follov/s: 

(2.38)  7^  - V +7  +7 

t X y 2 


The  magnitude  of  the  velocity  is  as  follows: 


(2.39) 

Vt=  Y, 

(2.40) 

h - Tv  * V * 

(2.41) 

A-B  = A B cos  a 

A-B 

cos  a = where  o < a < it 

AB  “ “ 

(2.42) 

ifA  = A.+A.  + A, 

ii  2J  3*^ 

and  F = B . + B . + B , 

2J  3^ 

(2.43) 

then  A-B  =AB+AB+AB 

1 1 2 2 3 3 

By  utilizing  the  dot  product  definition  2.43  we  get 


9.  ROOT  MEAN  SQUARE  (RMS)  EQUATION 


! I 

I ; The  root-mean-square  (RMS)  of  the  deviations  from  the  mean  ’s  derived 

f as  follows: 


Let  y = _ 1 (Arithmetic  mean) 

For  large  values  of  n (certainly  n>30) , the  following  equation  is  used 
for  RMS. 

(2.45)  RMS  = 

For  a small  population  (n_<30),  the  following  equation  is  used  for  RMS. 

(2.46)  RMS  = 


For  normal  distribution,  we  will  utilize  the  following. 


y - 2RMS  y - IRMS  y y + IRMS  y + 2RMS 


(Figure  2.10) 

where  one  standard  deviation  on  either  side  of  mean  covers  68.27S  of 
the  area  underneath  the  curve,  ±2  deviations  is  95.45%  and  ±3  deviations  is 
99.73%. 

Root-mean-square  deviation  is  utilized  in  subroutines  RCSLSQ,  RCSCAL, 
and  RCSXSC. 


10.  CURVE  FITTING  AND  THE  METHOD  OF  LEAST  SQUARES 


In  this  derivation,  the  notation  and  wording  has  been  changed  to  conform 
with  other  sections  of  this  appendix. 

Let  us  start  with  a scatter  diagram  in  figure  below 


(Figure  2.11) 

where  x is  a dependent  variable  and  y is  independent. 

From  the  scatter  diagram  we  can  pass  a smooth  curve  which  "fits"  the 
given  set  of  data  as  in  figure  2.12. 


(Figure  2.12) 

We  will  call  R.  the  difference  between  x.  and  the  smooth  curve.  This 
difference  R.  is  called  the  deviation.  It  is  also  sometimes  called  the 
error  or  residual  and  may  be  positive,  negative,  or  zero. 

DEFINITION;  Of  all  smooth  curves  approximating  a given  set  of  data  points, 
the  curve  having  the  property  that 

R2  + r2  + . . . + r2  -js  a minimum  is  called  a best  fitting  curve. 
1 2 n 

The  above  may  be  rewritten  as 


(2.47)  = 1 ~ S where  S is  at  a minimum 

A curve  having  this  property  is  said  to  fit  the  data  in  the  least  square 
sense  and  can  be  linear,  quadratic  (parabolic),  cubic,  or  a fourth  degree 
curve,  or  larger  according  to  method  used.  We  will  derive  the  linear  curve 
below. 


i 
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The  least  square  line  is  derived  as  follows: 
Let  y = Ax  + B 

y = f(x) 

It  follows  that 


R.  = B + Ax.  - y^ 

= f(x^)  - y^ 

Let  S = i"  ^ R? 

(2.48)  S = ij  ^ (B  + Ax.  - y^)2 


Taking  the  derivative  and  setting  the  results  equal  to  zero  we  obtain  the 


following; 

(2.49) 

^=21"  (B  Ax.  - yj  = 0 

6B  i = 1 ^ ^ 

(2.50) 

— = 2 = 1 x^(B  + Ax.  - y.)  = 0 

6A 

Let  us  substitute 


= 1 ’^i  = rx,  and  z y = y ^rom  here  on  in. 
i = 1 i 


Dividing  both  sides  of  equations  (2.49)  and  (2.50)  by  2 and  clarifying  we 
get  the  following  set  of  normal  equations 


(2.51)  Bn  + Atx  - ly  = 0 

(2.52)  BIX  + Aix2  - ly.x^  = 0 


i3 


1C  CURVE  FITTING  AND  THE  MCTIIOn  OF  LEAST  SQUARES  (CONTINUED) 


Solving  simultaneously  and  letting  A = ^ we  got 


(2.53) 


A = ^ = 
C 


ixy  - ^ Ex  sy 


1x2  - Ex  Ex 
n 


the  coefficient  A in  y = Ax  + B,  and 


(2.54) 


B = 


^ (Ey  E x2  -Ex  Exy) 


Ex2  - 1 Ex  EX 


Now  by  substitution  we  may  change  equation  2.54  to  obtain 


^ ly  Ex2  - ^ EX  Exy 


The  above  equation  is  further  manipulated  to 


B a ^x2  _ f zx  EX  Ey  _ EX  EX  EVx  ^Ex  E^X. 


E 


lb.  CURVE  FITTING  AND  THC  METHOD  OF  LEAST  SQUARES  (CONTINUED) 
Now  by  substitution  we  have 

B = JLY..  . ■ 

nC  nC  ' 


which  by  further  clarifying  and  remembering  that  A = ^ 


we  may  alter  to 


(2.55) 


B = 

n 


the  coefficient  B in  y = Ax  + B,  Hence  equation  2.53  and  2.55  are  utilized  in 
subroutines  to  compute  coefficients  A and  B.  Curve  fitting  using  the  method 
of  least  squares  is  utilized  in  RCSLSQ  for  "unskewing"  the  pulse  on  film  due 
to  photographic  process. 


11.  STANDARD  ERROR  OF  ESTIMATE 


If  we  let  represent  the  value  of  y for  given  values  of  x as  estimated 

frijm  y = Ax  + B , a measure  of  the  scatter  about  the  regression  line  of  y on 
X is  given  by 


(2.56) 


I (y  - y„t)2 


S = 


which  is  called  the  standard  error  of  estimate  of  y on  x. 
Squaring  equation  2.56  we  have 


= 


J (y  - y^st)' 


Now  we  substitute  y^^^  = Ax  + B and  obtain 


;2  = 


I (y  - Ax  - B)2 


Expanding  above  we  get 


j2  = _ £[y(y  - Ax  - B)  - Ax(y  - Ax  - B)  - B(y  - Ax  - B)1 


Now  we  clarify  the  above  expression  by  first  applying  the  definition  on 
summations 


r(ax  + by  - cz)  = a rx  + b Ey  - cEz 
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11 . STANDARD  ERROR  OF  ESTIMATE  (CONTINUCD) 

and  then  utilize  the  normal  equations  2.51  and  2.52  to  obtain 


-2  _ ryz  - Azxy  - Bry 
* n 


Taking  the  square  root  we  finally  have 


(2.57) 


s 


Zy^  - Azxy  - Bly 
n 


(The  standard  error  of 
estimate  of  y on  x) 


The  coefficients  A and  B used  in  equation  2.57  are  obtained  from  equati 
2.53  and  2.55.  The  standard  error  of  estimate  is  utilized  in  RCSLSQ  and 
RCSCLl  for  computing  threshold  value. 


12.  DENSITY  OF  A CHAFF  C1.0UD 


The  Density  of  a Chaff  Cloud  is  defined 
per  volume.  The  radar  cross-section  ) 


here  as  the  radar  crKSS-section 
is  as  shovm  in  equation  (2.32). 


(Figure  2.13) 

The  beamwidth  ©5  is  approximated  utilizing  equation  (2.5)  for  circular 
aperture  where  the  frequency  of  the  radar  and  the  diameter  of  the  reflec- 
tor are  taken  into  consideration. 

The  constant  for  depth  penetration  is  the  transmitted  pulse  duration  T 
in  microseconds  converted  to  kilometers  as  described  in  the  glossary. 

The  area  of  a circle  is  nr^.  The  radius  r of  the  circle  was  found  by 
using  the  following  equation  where  SR  is  tne  Slant  Range  in  kilometers. 

(2.i>8)  r = SR  sinf^ 

The  volume  V is  defined  as  the  area  of  the  circle  multiplied  by  the 
pulse  duration,  tau,and  the  units  are  kilometers  cubed. 

(2.59)  V ='irr2T 
The  density  D is  then 

(2.60) 

where  the  units  of  measurement  are  RCS(meters  squared )/Volume( kilometers 
cubed ) . 

This  computation  is  incorporated  into  subroutine  RCSXSC  only. 
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All  deck  setups  have  been  standardized  betv/een  each  subroutine  for 
conformity  within  itself  and  with  other  data  reduction  subsystems. 


The  mod  vie  may  be  utilized  to  process  one  subroutine  by  itself  or 
to  process  a multi-call  "stacked  job"  in  a computer  pass  or  run. 

Flexibility  is  built  into  the  subsystem  for  future  expansion  or  grov/th. 
This  can  be  accomplished  by  foilowin.5  the  technique  presently  used  in  this 
system  for  addition  of  new  subroutines  to  meet  additior.al  VISt/R  data  reduc- 
tion requirements. 

The  author  wishes  to  acknowledge  all  contributions  to  the  "state  of 
the  art"  of  radar  cross-section  data  software  programs  and  technical  reports 
by  various  people  throughout  the  years,  especially  Darold  W.  Comstock, 

Richard  H.  Dale,  Eugene  H.  Dirk,  Jr.,  and  Leonard  D.  Erickson,  in  NR-A, 

White  Sands  Mssile  Range,  and  Graham  Hall,  Harlan  F.  Lerum,  J.V.  .V.igliorato 
and  Neil  E.  Feichtner  at  the  Air  Force  Ballistic  RE-entry  System/Ealli stic 
Missile  Research  System  (AERES/E'.'RS),  Data  Center  at  Holloman  Air  Force 
Base.  The  list  is  not  necessarily  complete,  and  the  author  apologizes  for 
any  omissions. 

Due  to  the  dynamic  aspects  and  continual  evolutionary  process  in- 
volved in  data  reduction,  the  "state  of  the  art"  of  radar  cross-section  data 
is  continually  being  improved  upon  because  of  advances  in  computers  and 
computer  technology,  scientific  equipment,  and  the  discovery  of  better  data 
reduction  and  programming  techniques;  hence  suggestions,  ideas,  contributions, 
or  valuable  criticism  are  welcome,  and  will  be  incorporated  in  future  versions. 


A simple  definition  of  radar  cross-section  data  is  the  size  of  an  object  1 

as  it  appears  to  a radar  regardless  of  its  actual  size.  There  is  no  simple  j 

relationship  between  the  actual  size  and  the  radar  cross-section.  One  of  the  j 

methods  used  to  find  this  relationship  is  by  the  use  of  scale  models  in  an  j 

indoor  test  range  called  radar  anechoic  chambers.  The  models  are  place  on  | 

turn  tables  so  that  various  aspect  (view)  angles  may  be  obtained.  Another  1 

method  is  by  use  of  sphere  drops  or  sphere  raises  by  balloon  as  a method  of  | 

comparison.  The  sphere  technique  is  the  method  used  in  this  report.  Radar 
cross-section  can  then  be  defined  as  the  cross-section  area  of  a perfectly 

conducting  sphere  at  the  same  range  as  the  target  which  would  return  the  | 

same  power  as  the  taraet.  Normally,  a tethered  sohere  is  lofted  on  a 

ball  oon  or  a sphere  is  dropped  from  an  aircraft  after  a mission  has  been  1 

completed.  The  power  return  bounced  back  and  the  slant  range  is  measured  j 

and  recorded  for  sphere  and  this  becomes  the  BBAR  value  in  the  reduction  1 


of  mission  data.  Radar  cross-section  is  then  computed  from  BBAR,  power  return 
from  mission,  slant  range  from  radar  trajectory  data,  the  radar  theoretical 
back-scattering  cross-section  for  a perfectly  conducting  sphere  of  known 
size  and  frequency,  and  the  radar  equation.  The  RCS165  nodule  utilii^es 
the  fact  that  radar  cross-section  depends  on  radar  frequency,  the  angle  at 
which  beam  strikes  target,  the  polarization  of  the  signal,  plus  many  other 
radar  constants  and  variables  unique  to  each  radar  such  as  RAN,  RANPART , 

FPS16,  TTR,  HAPDAR,  RTMS,  DR,  and  KAR.  The  flexibility  to  handle  above 
mentioned  radars  is  built  into  the  subroutines  comprising  this  monitor. 
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B . Logical  Procedure  (CONTINUED) 


There  are  three  distinct  types  of  data  that  are  handled  by  RCS165 
For  better  clarity,  each  one  will  be  discussed  separately. 

The  first  is  A-Scope.  A-Scopc  is  a type  of  data  presentation  where  the 
echo  strength  or  target  amplitude  is  indicated  by  vertical  displacement 
(Y-axis)  of  the  luminous  spot  and  range  by  horizontal  (X-axis)  displacement 
on  a cathode-ray  tube.  There  is  no  angle  information  displayed  on  this 
type  of  data  presentation.  The  radar  signal  or  burst  of  energy  is  bounced 
off  the  target  and  optical  cameras  are  setup  and  calibration  procedures  are 
followed  for  A-Scope  recording  film.  The  data  is  recorded  on  35  mm  film 
in  three  ways;  Type  1 film  is  video  pulse  streak  camera  recording  (strip 
filni).  Type  II  is  framing  camera  recording,  and  Type  III  film  is  intensity 
modulation  streak  camera  recording.  The  timing  on  film  is  IRIG  timing. 

This  technical  report  covers  the  data  reduction  of  strip  and  frame  film  data. 
The  film  is  assessed  and  then  read  on  our  PFR-3  A-Scope  Film  Reader  System. 

The  A-Scope  Film  Reader  System  (APRS)  was  designed  by  Lincoln  Laboratory, 
Massachusetts  Institute  of  Technology  as  a part  of  an  Advanced  Research 
Project  Agency  (ARPA)  project  and  hyiit  by  Information  International  Incor- 
porated, Cambridge,  Massachusetts  to  read  film  on  a semi-automatic  basis. 

The  APR  System  utilizes  the  Procranmed  Data  Processor  (PDP-1)  comouter  to 
operate  and  control  input-output  devices.  A-Scope  type  data  is  primarily 
utilized  v.hen  target  discrimination,  individual  RCS  for  entire  pulse,  peak 
RCS,  or  other  signature  analysis  is  required. 

Tlie  second  type  is  the  Automatic  Gain  Control  (AGC)  type.  The  digitized 
AGC  data  recorded  on  radar  field  tape  is  pro-calibrated  on  the  boresight  tower 
in  5 do  steps  from  0 db  to  noise  level.  This  data  is  then  stored  in  the  Milgo 
(ARCADE)  Computer  memory  and  during  mission  the  target  video  is  referenced 
to  stored  AGC  precal . ARCADE  is  an  acbrevi atior  for  Automatic  Radar  Control 
and  Data  Equipment.  The  reference  AGC  level  is  t.hen  recorded  on  radar 
field  tape.  The  digitized  raw  data  is  produced  as  a positive  displacement 
above  the  noise  level.  Each  radar  launches  their  own  spheres  and  conducts 
the  sphere  calibrations  immediately  after  mission  completion.  This  type  of 
data  is  normally  converted  to  peak  RCS. 

The  third  type  is  the  Automatic  Gain  Control  - Voltage  Control  Oscillator 
(AGC-VCO).  The  detected  video  is  recorded  on  analog  tape  by  a telemetry  van 
station  utilizing  FH  methods.  The  data  is  later  digitized  to  machine  counts 
and  from  there  to  engineering  units  of  db  through  the  use  of  pre  or  post 
step  calibrations.  This  type  of  data  is  normally  converted  to  peak  RCS, 

All  three  types  utilize  sphere  calibrations.  The  third  type  utilizing 
telemetry  is  used  very  little  for  obtaining  peak  RCS  as  type  two  is  faster. 
Type  three  is  utilized  more  for  radar  system  analysis  work. 
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B. 


Logical  Procedures  (CONTINUED) 


Radar  signature  data  and  radar  cross-section  data  are  two  terms  that  are 
synonymous  and  are  used  interchar.geaoly  in  this  report. 

Radar  Cross-section  is  often  measured  in  DBSM  (db  per  meters  squared)  which 
is  plotted  as  shewn  in  figure  1.1.  Analysis  of  this  pattern  would  indicate 
shape  of  vehicle,  the  number  of  protrusions,  whether  target  is  spinning  and  if 
so  at  what  rate,  plus  other  information  according  to  needs  of  customer.  RCS 
is  often  plotted  against  time,  altitude,  aspect  angle,  and  slant  range. 

Figure  1.2  shows  the  characteristics  of  instrumentation  Radar  A!l/r?S16  and 
Figure  1.3  is  a map  of  the  "on  range"  radar  sensor  instrumentation  sites  for 
FPS16,  chain,  surveillance,  and  project  radars. 

Target  discrimination  and  penetration  aid  techniques  are  but  two  of  the 
many  areas  dependent  on  the  output  of  a program  like  RCS165.  One  of  the 
fields  where  analysis  of  RCS  data  is  required  is  in  re-entry  body  studies. 

As  a vehicle  re-enters  the  atmosphere,  the  shock  waves  formed  cause  a layer 
of  electrons  on  the  body  of  the  vehicle.  This  plasma  sheath  causes  a drastic 
change  in  cross-section  as  compared  to  free  space  cross-section.  Tlie  ionized 
field  or  wake  which  follows  the  vehicle  also  causes  a reduction  or  enlargement 
of  cross-section.  The  study  of  this  phenomena  in  signature  data  analysis  can 
be  utilized  in  the  development  of  anti  radar  signature  decoys  or  in  the  determin 
atior,  of  enemy  warheads  in  a mass  of  "e-entry  vehicles  by  cross-section  data 
predictions.  Its  measurements  (the  recognition  of  shaoes  such  as  conical, 
cylindrical , spherical,  or  any  geometrical  combination)  and  numerous  other 
techniques  are  now  being  refined  by  3ignatu>'e  Data  Analyst  contractors  through- 
out the  nation.  The  Signature  Analyst  can  then  be  expected  to  come  up  with  a 
reasonable  approximation  of  the  unknown  body  by  kno'wing  the  characteristic 
returns  of  certain  bodies  and  the  recognition  and  cataloging  of  different 
shapes . 

Since  warhead  must  be  identified  in  time  for  defensive  action,  a 
computer  is  necessary.  A computer,  however,  tends  to  take  things  too  literally 
if  a return  differs  sligntly  from  the  description  which  is  given  to  it,  the 
computer  will  not  recognize  the  object.  Sut  with  new  computers  which  are 
capable  of  learning  and  with  i.mprovsd  optical  technique  for  pattern 
recognition,  perhaps  this  problem  is  closer  to  a solution. 

Another  big  area  where  RCS  Data  Analysis  plays  an  important  role  is 
in  Electronic  Countemieasures  (ECM)  studies.  ECM  embraces  such  techniques 
as  jamming  transmitters,  set-on  receivers,  false  target  repeater,  simula- 
tion of  different  aircraft  radar  signatures  by  the  use  of  decoys,  chaff 
clouds  cut  to  an  appropriate  size  to  resonate  at  the  frequency  of  tlie  radar 
to  be  jauined,  and  other  techniques  by  which  the  enemy  is  denied  the  use  of 
the  electromagnetic  spectrum  for  radar,  conmuni cati ons , navigation  and 
guidance. 


B,  Locicq]  Proceduror  (COl'IllJUKD) 


The  future  of  RCS  i?;  very  bright  incieed.  The  resolution  capability 
of  radar  is  constantly  being  upgraded.  With  the  use  of  synthetic-spectrum 
radar,  chirped  radar,  phased-array  systems,  etc,,  the  resolution  should 
improve  to  the  point  where  the  image  of  a distant  target  will  correspond  more 
to  its  physical  than  its  electrical  features. 


The  behavior  of  wake  and  plasma  phenomena  is  becoming  better 
stood  as  more  advances  are  made.  Coupled  with  new  approaches  to 
printout,  this  should  provide  displays  of  greater  validity.  The 
field  of  RCS  in  re-entry  physics,  radar  profiling,  passive  radar 
and  large-scale  air  traffic  control  has  barely  been  opened, 


under- 
computer 
enti re 
detection , 


FOR  FP3  16 'C- BAND  RADAR 


:CT  OR  VIEW  ARGUE  IN  DEGREES 
{FIGURE  1. 1> 


TABLE  OF  CHARACTERISTICS  OF  INSTHUUrHTATlOfl  RADAR  AN/FPS-16 


System  Noise  Figure 

I.F.  Center  Frequency 

Banciwidths 

Local  Oscillators 

AFC 

lion-Tracking  I.F. 


RECEIVER  SrSTCM 

- < 11  db 

- 30  MC 

- Wide  8.0  MC.  Narrenv  1.6  MC 

- Two  - Skin  anil  Coacon 

- Skin  or  Beacon 

- flon-galed  manually  gain  controlled  receiver, 
video  added  to  tracking  video  for  display  only 


RANGE  TRACKING  SVSTEM 


Range 

Gate  Widths 


Aided  Tracking 
Maximum  Slew  Rate 
Maximum  Tracking  Rate, 
Automatic  Lock-On 
Servo  Bandwidth 


Range  Accuracy 


- 1 ,000,000  yards 

- Pulse  widths  0.25  js,  0.5  us,  1.0  us 
Acquisition  I .0  us , 1.25  vS,  1.75  us 
Tracking  e.5  ;.S,  0.75  uS,  1.25  .s 

- Yes 

- 40,000  yard;  per  second 

- 12,000  yards  per  second 

- Search  ±1000  yards,  and  auto  lock 

- Continuously  adjustable  manually  or  autoi'jf! ral ly 
between  0.5  cos  (K  = 2000)  and  0.0  cps 

- .'3.5  la  .'is  yards 


ANGLE  TRACKING  SYSTEM 


Aided  Tracking 
H.iximum  Slew  Rate 
Maximum  Tracking  Rate 
Servo  Bandwidth 


Scan 

Angle  Accuracy 


- Yes 

- Azimuth  48°  per  second.  Elevation  37"  per  second 

- Azimuth  42°  per  second,  Llevation  22.5°  per  second 

- Continuously  adjustable  manually  or  automatically 
between  0.5  cps  (K  = 150)  and  4.0  cps  (K^  ■=  300) 

- Circle  scan  of  adjustable  radius  and  rate.  Also 
sector  scan  in  azimuth  and  elevation 

- +0.05  mi  1 to  ±0.3  mi  Is 


Potentiometer 

Sycro 

Digital 


DATA  OUTPUTS 


- Army  or  Navy  speeds 

- Serial  straight  binary 

Range  - 20  bits  (0.5  yard  quanta) 
Angle  - 17  bits  ( < .05  mils  quanta) 


(Figure  1.2) 


TABLE  OF  CHARACTERISTICS  OF  INSTRUMCHTATIOH  RADAR  AN/FPS-16  2 » 


ANTENNA  PEDESTAL 


Axes 

Weight 

Plunging  Capability 
Azimuth  Coverage 
Elevation  Coverage 


- Azimuth  and  Elevation 

- 12,000  pounds 

- Yes 

- Continuous  360° 

- 10°  to  190° 


ANTENNA  SYSTEM 


Size 

Feed 

Gain 

Beamwidth  03  db  pts 
Polarization 

Rotary  Joints  and  Waveguide 


- 12  foot  diameter  parabola 

- 4 - Horn  Honopulse 
-44.5  db 

-1.2  degrees 

- Vertical 

- 3 Megawatt  capability 


TRANSMITTER 

TYPE  - MAGNETRON 

Peak  pQv/er  Frequency  Max.  Duty  Cycle 
R-113.  R-i23,  R-127  3MW  5450-5825  MC  0.001 


Eight  Other  FPS16's  IMW  Fixed  Tuned  5480  i 35HC  0.0010 
250  KW  Tunable  5450  - 5825MC  0.0016 


PRF  - Internal : 
Internal : 
External : 


3HW-12  steps  between  142  to  1707  PPS 
lMW-18  steps  between  71  to  1707  PPS 
Any  PRF  between  160  to  1707  PPS 


Pulse  Width  - 0.25  ys,  0.5uS  and  I.Ops 

Coding  - Up  to  5 pulses  within  Duty  Cycle  Limitations 


(Figure  1.2) 
CONTINUED 


DISPLAYS 


Range  - Dual  A-Scope 

Dials 

Digital-Octal  numeral 
Angle  • Dials 

Digital-Octal  numeral 


MONITORING 

Noise  Figure  Measurement 

Transmitter  Power 

Range  and  Angle  Servo  Performance 

Significant  Wavefems 

Significant  Voltages  and  Currents 

Strip  Chart  Recorder  and  Signal  Patch  Panel 


USE 

Provides  real-time  infon’ai’on  to  the  Missile  Flight  Safety  Officer, 
liajectory,  A-Scope,  AGC  doLa  to  t.' ■:  project,  acquisition  > to  oi*-'”- 
range  data  gathering  systems,  and  ■■■■ : te'^ing  data  for  drones  and  target 
aircraft,  transmit  digital  data  In  real  time. 
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A NONUNEAR  SINGULARLY  PERTURBED  VOLTERRA  INTEGRODIEFERENTIAL 
EQUATION  OCCURRING  IN  POLYMER  RHEOLOGY 


A.S.  Lodge  J.B.  McLeod  and  J.A.  Nohel^^' 

Mathematics  Research  Center,  University  of  Wisconsin,  Madison , Wisconsin 

ABSTRACT 

We  study  the  initial  value  problem  for  the  nonlinear  Volterra 
Integrodifferential  equation 

t 

r - iiy'(t)  = / a(t-s)F(y(t),  y(s))ds  (t>0) 

( + ) -s 

^ y (t)  = g (t)  (-co<  t < 0 ), 

where  ji  > 0 is  a small  parameter,  a is  a given  real  kernel,  and  F,  g 
are  given  real  functions;  {+  ) models  the  elongation  ratio  of  a homogeneous 
filament  of  a certain  polyethylene  which  is  stretched  on  the  time  interval 
(-0C,  O],  then  released  and  allowed  to  undergo  elastic  recovery  for  t>  0. 
Under  assumptions  which  include  physically  interesting  cases  of  the  given 
functions  a,  F,  g,  we  discuss  qualitative  properties  of  the  solution  of  ( + ) 
and  of  the  corresponding  reduced  problem  when  = 0,  and  the  relation 
between  them  as  |i  —0^,  both  for  t near  zero  (where  a boundary  layer 
occurs)  and  fcr  large  t.  In  particular,  we  show  that  in  general  the 
filament  does  not  recover  its  original  length,  and  that  the  Newtonian  term 
-p.y*  in  (+)  has  little  effect  on  the  ultimate  recovery  but  significant 
effect  during  the  early  part  of  the  recovery. 


Sponsored  by: 

The  United  States  Army  under  Contract  Number  DAAG29-73-C-0024; 
National  Science  Foundation  under  Grant  Number  ENG  75-18397; 
^^The  United  States  Army  under  Grant  Number  DAHC04-74-G-0012; 
^^National  Science  Foundation  under  Grant  Number  MCS  75-21868. 


1 . Introduction . 


We  study  the  nonlinear  Volterra  Integrodlfferentlal  equation 

t 

(1.1)  -HLy'(t)  = / a (t-s)  F (y(t),  y(s))ds  (t>0;  ' = d/dt) 

-00 

subject  to  the  initial  condition 


(1*2)  y (t)  = g(t)  (-oo< t £ 0)  . 

This  initial  value  problem  arises  as  a mathematical  model  for  a process 
in  polymer  rheology  which  is  described  in  Appendix  A.  In  the  specific 
problem  discussed  there  p is  a positive  parameter  related  to  the  viscosity, 
and  the  given  real  functions  a,F,  g take  the  forms 

m 

(1.3)  a(t)  = Yj  (- tAj, ) 

k=  1 

where  aj^  and  are  positive  constants, 

(1-4)  F(y,  z)  = yVz^-z  , 

and 


g(t)  = 


1 if  -00<t<-tQ  (1jj>0) 

e if  -tp  < t < 0 , 


where  ^ is  a positive  constant  (see  equations  (A5),  (A  Zl),  (A  24),  (A25) 
in  Appendix  A).  The  unknown  function  y measures  the  ratio  of  the 
extended  length  to  the  original  length  of  a homogeneous  filament  which 
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Is  stretched  In  accordance  with  (1.2)  on  the  time  Interval  ( -«c,  O]  and 
then  released.  The  equation  (1.1)  then  describes  the  process  of  elastic 
recovery.  Also  of  interest,  both  mathematically  and  physically,  is  the 
reduced  equation 

(1.6)  0=  J a(t-s)  F (y(t),  y(s))ds  (t>0), 

-00 

with  y(t)  = g(t)  on  (-oc,  0),  and  in  particular  the  relation  between 
I the  solutions  of  (1.1),  (1.2)  and  of  (1.6)  as  — 0^;  for  small  n > 0, 

(1.1),  (1.2)  may  be  regarded  as  a singular  perturbation  of  (1.6). 

The  purpose  of  this  paper  is  to  discuss  the  qualitative  behaviour  of 
solutions  of  (1.1),  (1.2)  on  the  one  hand  and  of  (1.6)  on  the  other,  and 
the  relation  between  them  as  — 0^.  Our  analysis  is  not  confined  to  the 
specific  forms  of  a,  F,  g in  (1.3)  - (1.5),  but  rather  we  abstract  the 
essential  properties  of  these  functions.  Two  results  of  particular  interest 
are  : (i)  in  general,  the  filament  does  not  return  to  its  original  length,  as 
confirmed  by  experiments  (see  Appendix  A and  Theorems  4 and  5); 

(ii)  similarly  to  behaviour  in  singular  perturbation  problems  for  ordinary 
differential  equations,  the  solution  of  (1-1),  (1.2)  for  small  p.  > 0 

decreases  rapidly  as  t Increases  near  t = 0 to  the  solution  of  (1.6), 
becoming  close  to  it  in  a "boundary  layer"  time  interval  of  order  | log  1 , 
and  thereafter  remains  close  (Corollary  3.1  and  Theorems  7 and  8).  The 


Introduction  of  the  Newtonian  term  -jiy'  in  (1.1)  has  little  effect  on  the 
ultimate  behaviour  of  the  solution . 

From  the  mathematical  point  of  view  the  theory'  of  integrodifferential 


equations  with  decreasing  convex  kernels  and  with  nonlinearities  consisting 
of  functions  of  one  variable  is  well  known,  see  e.g.  [7],  [18]  where 
references  to  other  literature  are  given.  A part  of  the  novelty  of  the 
present  analysis  is  that  F in  (1.1)  and  (1.6)  is  a function  of  two 
variables,  y(t)  and  y(s). 

We  acknowledge  with  pleasure  the  various  helpful  discussions  with 
colleagues  during  the  preparation  of  this  paper,  in  particular  with  M.G. 
Crandall,  R.W.  Dickey,  F.  Howes,  J.J.  Levin,  S.O.  Londen,  S.V.  Farter, 
A.  Pazy,  D.F.  Shea,  O.J.  Staffans,  L.  Tartar,  and  W.  Wasow. 


2.  Statement  of  Results. 

Let  R denote  the  real  numbers,  the  positive  real  numbers, 
and  C the  set  of  k times  continuously  differentiable  functions. 

We  make  the  following  assumptions  on  the  functions  a,  F,  g 
throughout; 

{a  e [0,oo);  a(t)  > 0,  a«(t)  <0  (0  < t < «)  ; 

a e L^  (0, oo);  log  a(t)  is  convex  (0  :<t<oo),  i.e. 

a*(t)/a(t)  is  nondecreasing ; 
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It  is  readily  verified  that  the  specific  functions  defined  in  (1.3)  - (1.5) 

satisfy  (H^),  (H„)  and  (H  ) respectively.  While  for  the  majority  of 
at  g 

results  these  are  the  only  conditions  required,  some  additional  assumptions 
are  needed  in  Theorems  4 and  5,  Corollaries  2.1  and  3.1,  and  Theorems  6 ai 
The  first  result  concerns  the  global  existence  and  uniqueness  of 
the  solution  of  (1.1),  (1.2)  for  a fixed  p > 0 and  gives  some  useful 
properties  of  the  solution. 

Theorem  1.  Let  (H^),  (Hp),  (H^)  be  satisfied . Then  for  each  p > 0, 
the  initial  value  problem  (1.1),  (1.2)  has  a unique  solution  4)(t,  p)  on 
[0,  co)  satisfying  the  following  properties: 

<j.’(t,p)<0  ^ 1 < <i>(t,p)  < g(0)  (0<t<co); 

if  9^92  satisfy  (Hg)  and  if  g^ft)  > g^ft)  (-oo  < t < 0), 
then  the  corresponding  solutions  (*)j(t,  p),  <})^(t,  p)  ^ (l-l), 

(1.2)  satisfy  <{)j(t,  p ) s 4>^(t,p)  (0st<oo); 

if  pj>  p^,  then  the  corresponding  solutions  of  (1.1),  (1.2) 
satisfy  <}>(t,p^)  > <))(t,p^)  (0<t<oc). 

An  Immediate  consequence  of  (2.1),  (2.3)  is 

Corollary  1.1.  Let  (H^),  (Hp),  (H^)  be  satisfied.  Then  for  each  fixed 
p > 0 one  has 

(2.4)  a{p)  = lim  <t>(t,p)  exists  and  a(p)  2:  1 ; 

t —CO 
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moreover.  If 

Theorem  1 is  proved  in  Section  3 . 

Remark  1.1.  In  the  specirl  case  a (t)  = Ae  , A>  0,  X>  0 which 

satisfies  (H  ) (with  = -X),  conclusions  (2.1)  and  (2.4)  can 

a a il) 

be  strengthened  respectively  to  the  following : 

(2.1')  «|)'(t,p)  < 0 and  K Yq  < <}>(t,^i)  ^9(0}  (0st<co), 

(2.4')  a(^l)  = lim  4)(t,iJL)  exists  and  a(^i)  > y > 1, 

t -*eo 

where  (see  the  proof  of  (2.8)  in  Theorem  3)  y^  is  uniquely  defined  by 
the  equation 

0 

/ e^^  F(yQ,  g(s)  ) ds  = 0 . 

-00 

The  proof  of  Remark  1.1  is  carried  out  in  the  course  of  proving  (2.1) 

( Case  (ii)  ). 

Remark  1.2.  If  one  is  interested  only  in  global  existence,  rather  than 
further  properties  of  solutions  of  (1.1),  (1.2),  then  the  following  existence 
result  can  readily  be  established. 

Proposition  1.  Let  a e L^(0,  oo),  a(t)'>  0 (0  st<oo);  let  g 
satisfy  (Hg);  l^t  F e C (■jit'’'x  IR"^  ) , F(x,  x)  = 0 for  every  x e R'*’ 
and  F(y,  z)  < 0 for  y < z and  F(y,  z)  > 0 _for  y>  z(0<y,  z<oo). 
Then  for  each  fixed  p>  0 the  initial  value  problem  (1.1),  (1.2)  has 
at  least  one  solution  on  0 < t < « . 
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The  proof  of  this  result  follows  from  the  following  observations;  (i)  under 
the  present  hypotheses  (1.1),  (1.2)  has  a local  solution  on  0 s t < t^ 
(see  e.g.  [17,  Lemma  1.1],  also  [16]);  (ii)  if  y(t)  is  any  solution  of 
(1.1),  (1.2)  on  0 s t < oc,  then  a slight  modification  of  the  early  part 
of  the  proof  of  Theorem  1 shows  that  1 < y(t)  s g(0)  (0<t<oo); 

(iii)  in  view  of  (i),  (ii),  every  such  local  solution  can  be  extended  (but 
not  necessarily  uniquely)  to  the  Interval  [O, «)  (see  e.g.  [17,  Lemma 
1.2],  also  [16]). 

We  next  obtain  some  estimates  of  the  solution  <}>(t,p)  of  (1.1), 
(1*2)  which  are  useful  for  the  study  of  the  asymptotic  behaviour  as 
t — CO,  and  which  will  be  used  to  deduce  the  existence  of  a solution  of 
the  reduced  problem  (1.6). 

Theorem  2.  Let  (H_),  (H^.),  (H_)  be  satisfied.  Then  there  exists  a 

a r g — 

constant  > 0 (independent  of  ) and  constants  p ^ > 0 and 
K = K(pp)  > 0 such  that  the  solution  <})(t,ji)  of  (1.1),  (1.2)  satisfies 
the  estimate 

O' 

(2.5)  0 < -<t.'(t,p)  < — exp(-Kjt/|ji)  + K/a(s)ds  ( 0 <t<oo  ; 0 <p  < ) . 

t 

As  a consequence  of  Theorem  2 and  the  logarithmic  convexity  of  a,  one 
obtains 

Corollary'  2 . 1 . _If , ^ addition. 

A “ 

(2.6)  J ta(t)dt  < 00 

0 
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U satisfied,  then  there  exist  constants  > 0 and  K = K ) > 0 
such  that 

(2.7)  0 < - a(jt)  s K j(s-t)a{s)ds  ( 0 s t<ce;  0<n  s ) , 

t 

where  ot(^i)  = Hm  4)(t,^l). 

t -►« 

Theorem  2 and  Corollary  2.1  are  proved  in  Section  4. 

Concerning  the  reduced  problem  (1.6)  we  prove  the  following  global 

results . 


Theorem  3.  Let  (H_).  (H„).  (H  ) be  satisfied.  Then  (1.6)  has  a 
a t g 

unique  (continuous)  solution  <})^  ^ [0,  oo)  satisfying  the  following 

properties ; 


(2.8)  if  then  1 < y^  < g (0)  ; 

iL  a(t)  i Ae"  , A > 0,  X > 0 , then 

(2.9)  <|>p  € C^[0,oo),  «).(;(t)<  0 and  (0st<oc); 

if  a (t)  £=  Ae"^^  , A > 0,  X > 0,  then  4)Q(t)  = y^^  ( 0 s t < oc); 


11  91,92  satisfy  (H  ) and  ^ g^ft)  > g^lt)  (-oo<t:s0). 


(2.10)  I then  the  corresponding  solutions  4> 
<t,jj>(t)  > (t)  (0  :£t<oo)  ; 


(1)  J2) 


4>'  ' of  (1.6)  satisfy 


(2.11) 


11  «t>(t.ii) 

and  11  <J)Q(t) 

( 0 S t < eo). 


is  the  solution  of  (1.1),  (1.2)  for  a fixed  ji>  0, 
is  the  solution  of  (1.6),  then  <|>Q(t)  < <}>(t,p) 
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As  a consequence  of  (2.4),  (2.9),  (2.11)  we  have  the  first  statement 
Corollary  3.1.  Let  (H^).  (H_).  (H^ ) be  satisfied.  Then 

ary 

(2.12)  a = lim  exists  and  1 < a s a(  ^ ) ; 

u ^ u u 


If  a(t)  H Ae“^'  , A > 0,  X > o,  then,  in  fact,  > 1 . If  also 

(2.6)  holds,  then 


(2.13) 


lim  a(^JL)  = 


(2.14)  0<<i>(t,vv)-4>o(t)  ^a(ji)-a^+K /(s-t)  a(s)ds  (0st<co; 


Theorem  3 and  (2.13),  (2.14)  are  proved  in  Section  5.  In  Theorem  8 
below  we  establish  a more  precise  result  than  (2.13),  (2.14)  under  some 
additional  assumptions. 


Remark  3.1.  An  estimate  similar  to  (2.5)  holds  for  the  solution  «}>q  of 
the  reduced  problem  (1.6),  (but,  of  course,  without  the  term  — exp(-K^t/^)). 
This  can  be  proved  as  a special  case  of  the  proof  of  Theorem  2 by  obtaining 
an  estimate  of  the  form  (4.6)  in  Section  4,  where  f is  now  independent 
of  fi,  and  using  it  and  an  estimate  of  the  form  (4.4)  (also  now  independent 
of  ) in  (5.12)  of  Section  5.  Consequently,  (2.7)  also  holds  with 
4>(t,^)  replaced  by  4>Q(f)  “(M')  and  with  0<t<oo. 
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The  next  task  is  to  establish  the  physically  Important  fact  that  the 


limiting  value  a(p,)  of  the  solution  <j>(t,p)  of  (1.1),  (1.2)  as  t oc 

satisfies  oi{p)  > 1 (p>  0),  rather  than  the  weak  form  ^ 1 in 

(2.4).  By  properties  (2.11),  (2.12)  it  suffices  to  prove  *^0  > 1 

''  * 

(a^  = 11m  <|>jj(t)).  ' 

t — 00 

Theorem  4.  Let  (H^),  (Hp  ),  (H^)  be  satisfied.  If,  in  addition.  i 

(2.6)  ^ satisfied,  then 

(2.15)  Op  > 1 . 

We  remark  that  in  the  special  case  a (t)  = Ae~^^  , A>0,  X > o,  there 
is  nothing  to  prove  since  4)^  (t)  s y^  > 1 . Theorem  4 is  proved  in 
Section  6. 

Theorem  4 is  best  possible  in  the  following  sense. 

Theorem  5.  Let  (H  ).  (H_)  be  satisfied  and  let 

a g 

(2.16)  F(y,  z)  = y - z . 

IL 

(2.17)  r sa(s)  ds  = 00  , 

0 


Remark  5.1.  F(y,  2)  = F^Cy-z),  where  F^  > 0 Is  a constant,  ls,being 
linear,  the  simplest  form  of  F coisslstent  with  assumptions  (lip  ) . 

Remark  5.2.  A similar  proof  applied  to  (l-l),  (1*2)  shows  that  if  the 

hypotheses  of  Theorem  5 hold,  then  lim  <j>{t,  p)  = 1,  where  <|>(t,  p) 

t -'00 

Is  the  solution  of  (1.1),  (1.2)  with  F satisfying  (2.16). 

Theorem  5 is  proved  in  Section  7 . 

In  connection  with  the  last  statement  In  Remark  3.1  It  is  of 
Interest  to  note  that  such  an  estimate  for  <})q  , (2.7),  can  be  proved  inde- 
pendently of  Theorem  2,  and  we  state  this  fact  as  a separate  result  in 
Theorem  6.  However,  it  should  also  be  noted  that  the  estimate  (2.5) 

(for  ) described  in  Remark  3.1  cannot  be  obtained  from  Theorem  6. 

Theorem  6.  Let  (H^),  (Hp),  (H^)  be  satisfied,  and  lot  (2.6)  hold.  Then 
there  exists  a constant  K > 0 such  that , 1£  a(t)  ^ A e~^^  , A > 0,  X > 0 , 

(2.19)  0 < I (s  - t)a(s)ds  (0  < t < «)  . 

Theorem  6 is  proved  in  Section  8. 

Remark  6.1.  One  can  also  prove  the  estimate  (2.7  ) for  the  solution  4;(t,p) 
of  (1.1),  (1.2)  in  the  manner  of  Theorem  6,  without  using  (2.5  ). 

Our  next  task  is  to  establish  the  existence  of  a boundary  layer  in  a 
neighbourhood  of  t = 0 as  p -*  0^.  For  this  purpose  we  consider  the 
following  approximation  of  the  problem  (1.1),  (1.  2)  for  small  t > 0; 

0 

(2.  20)  -pv'(t)  = / a(-s)r(v(l),g(s))ds  (t  > 0;  v(0)  = g(0))  . 

-00 
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It  will  be  observed  tliol  (2.  20)  is  not  a Vollerra  equation,  but  acts  rather 
like  an  ordinary  differential  equation.  Performing  the  stretching  transformation 


(2.  21)  t = jxT 

1, 

and  setting  w{t)  = v(t)  transforms  (2.  20)  to 
. 0 

(2.22)  " ” / a(-s)F(w(T),g(s))ds  (t  > 0;  w(0)  = g(0))  , 

-00 

Theorem  7 . Let  (H  ),  {H„),  (H  ) be  satisfied.  Then  the  initial  value 
problem  (2.  22)  has  a unique  solution  w = |(t)  existing  on  0 < t < « 
and  satisfying  the  following  properties : 

(2.  23)  lim  Kt)  = = <1,^(0):  0 < ^Ct)  - y < (g{0)  - yo)e'*^''  (0  < t < « ) , 

where  4)^  1^  the  solution  of  (1.6)  (see  Th eorem  3 ) and  K ^ some  positive 

constant. 

Moreover,  ^ 4’(t, the  unique  solution  of  (1.1),  (1.  2)  in 
Theorem  1 and  if  4(t/fji)  is  the  unique  solution  of  (2.  20)  for  > 0,  then 
I to  any  ^ 0 there  exists  a constant  K > 0 (independent  of  p.) 

such  that 

(2.24)  l4){t,p)  - Ut/p)I  < Kt  (0<t<tp;p>0) 

The  estimate  (2.  24)  establishes  the  existence  of  a boundary  layer  in  a 

i 

I- 

positive  neighbourhood  of  t = 0.  Theorem  7 is  proved  in  Section  9. 

In  Corollary  3.1  we  showed  that  o-fp)  — as  p -►  0^,  so  that 
the  solutions  4>(t»K)  of  (1*1),  (1.2)  and  4>Q(f)  of  (1.6)  do  not  differ  by 
I much  for  small  p > 0 and  for  large  t.  Our  final  result  makes  this 

L. 


I 


i 

i 
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precise,  under  the  additional  assumptions  that.  ta(t)  < L (0,«>)  and 


r c X Ir'^). 

Theorem  8.  Let  (H  ).  (H„),  (H  ) be  satisfied.  In  addition,  assume  that 

a X ’ 9 

2 + + 

F c C (IR  X IR  ) and  that  (2.  6)  holds.  Then  there  exist  constants  K > 0, 
jip  > 0 and  a function  y c C*[  0, «> ),  y positive,  bounded  and  non- 
decreasinq^  such  that 

(2.25)  • <.p(t)  < <|:(t,p)  < <|,^(t)  + (g(0)  - «t,p(0))exp(-Kt/p)  + Y(t)p  |log  pi 

(0  < t < » ; 0 < p < pp)  . 

In  particular^  as  an'immediate  consequence  of  (2.  25),  there  exists  a 
constant  K > 0 such  that 

(2.  26)  0 < «j>(t,p)  - <^^(1)  = 0(p  |log  pl),(p  — 0^;  Kp  [log  p I < t < w ) . 

Theorem  8 is  proved  in  Section  10. 

The  method  of  proof  of  Theorem  8 uses  the  notions  of  upper  and 
lower  solutions  of  (1.1).  The  necessary  preliminary  material,  which  follows 
the  lines  of  well  known  results  (see  e.g.  [6],  [19])  is  collected  in 
Appendix  B.  The  inequality  (2.  25)  is  established  by  showing  that  the 
solution  (}>Q  of  the  reduced  problem  (1.6)  is  a lower  solution  of  (1.1) 
on  0 < t < « and  by  showing  that 

w(t,p)  = 4>Q(t)  + (g(0)  - 4!jj(0))exp(-Kt/p)  + y(t)p|log  p| 

is  an  upper  solution  for  suitably  chosen  K and  y (i.e.  that 

- pw'(t)  < J a{t- s)  F (w(t),  w(s)  )ds  for  0<t<oo, 

"CO 

where  w(t)  = g(t)  on  -oo<t<0).  Inequality  (2.25)  then  follows  from 
Proposition  2B,  Appendix  B. 
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The  question  crises  whc:lher  the  order  0(|i  |log  |a  I)  In  (2.  26)  Is 
best  possible.  In  tire  linear  cose  it  Is  not;  for  if  F(y,z)  = y - z,  one 
can  establish  the  inequality 

«|>Q(t)  < <Kt,p)  < .tplt)  + (g(0)  - if^{0))exp{-Y.t/ii)  + Y(t)p 

for  0 < t < 00  and  0 < p < by  the  method  of  proof  of  Theorem  8 . 
In  addition,  one  can  compute  afp)  and  in  the  linear  case  by  the 

method  of  Laplace  transforms  and  show  that  o-fp)  - is  precisely 
of  the  order  p.  In  the  general  case,  however,  we  have  been  unable  to 
Improve  the  estimate  (2.  26). 


2.’  Froof  of  Thcoreni  I.  The  classical  Picard  successive  approximations 
(or  the  Banach,  fixed  point  theorem)  applied  to  the  integrated  form  of  (1.1), 

(1.  2)  show  that  for  each  fixed  p > 0 there  is  a unique  local  solution 
«|>(t,p)  existing  and  in  on  some  interval  [0,T]  , T > 0.  To  show 
that  this  solution  can  be  continued  (necessarily  uniquely  in  view  of  the 
assumptions)  to  the  interval  [O,*)^  it  suffices  to  establish  the 
inequalities  (2.1)  on  any  interval  on  which  the  solution  4(t,p)  exists. 

For  then  the  solution  satisfies  ^ priori  upper  and  lower  bounds,  independent 
of  T,  and  hence  can  be  continued  to  the  interval  [0,«>)  by  a standard 
result  ( 17] . 
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To  establish  (2.1)  on  any  Interval  on  which  exists  we 


have  from  (1.1),  (1.  2) 


0 

(3.1)  • -ji<|;’(0,ji)  = / a(-s)r(g(0),g(s))ds  . 

_oo 

Tne  integral  clearly  exists  since  a c l\o,«>)  and  F(g(0),g(s))  is 
bounded  on  (-»  ,0]  by  (H  ) (H  ).  From  (H  ),  a(-s)  > 0 (-oo  < s < 0) 

r Q d 

and  from  (H^),  (H^),  F(g(0),g(s))  > 0 (-«  < s < 0),  with  the  strict 

Inequality  holding  for  large  negative  s;  therefore,  4>'(0,  ji)  < 0.  Since 
^ < c\  one  has  by  continuity  that  <}>‘(t,p.)  < 0 (0  < t < o-),  for  some  a > 0 
We  claim  first  that 


(3.2)  4)(t,|i)>l  (0<t<tt). 

Indeed,  <|)(0,p.)  = g(0)  > 1,  and  by  continuity  (3.2)  holds  at  least  on 

some  Interval  to  the  right  of  t = 0.  Suppose  0 < t^  < o-  is  the  first 
point  at  which  <J.(t^,|x)  ^ 1,  and  1 < (^ft,!!)  < g(0)  (0  < t < t^).  From  (1.1) 
we  have 

0 t 

(3.  3)  -p<|)  (t.,p,)  = / a(t.  - s)F(l,g(s))ds  + / ■*  a(t  - s)F(l,  4(s,|x))ds  . 

-00  0 ^ 

By  (H^),  a(t^  - s)  > 0 (-oo  < s < t^),  and  by  (H^),  (H^),  F(l,  g(s))  < 0 
.on  ("“5,0],  with  the  strict  inequality  holding  near  zero.  Since  •f'  is 
strictly  decreasing  on  (0,t^],  we  also  have  F(l,?}:(s,p))  < 0 (0  < s < t^). 
Therefore  each  integral  in  (3.3)  is  negative  and  4'(t^,p.)  > 0,  which, 
in  view  of  <^'{t,p)  < 0 (0  < t < o),  is  impossible;  this  proves  (3.  2). 
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Wc  next  clolm: 


(3.4)  . 

for  as  long  as  the  solution  exists.  Indeed,  suppose  for  contradiction 
that  o > 0 Is  the  first  point  at  which 

(3.5)  <|)'(o,p)  = 0 and  <})*(t,p)<0  (0  < t < a)  . 

By  the  argument  of  the  preceding  paragraph  we  have 

(3.6)  ’1  < ,J.(o,fi.)  < g(0)  . 

To  prove  (3.4)  we  compute  (})"{Q',p)  from  (1.1)  and  we  shall  obtain  an 
obvious  contradiction  of  (3.  5)  by  showing  that 

(3.7)  l|.•'(Q',^l)  < 0 ; 

this  implies  that  no  such  a > 0 satisfying  (3.5)  exists  and  proves  (3.4). 

Indeed,  differentiating  (1.1)  (justified  by  (H  ),  (H^),  (H  ) - note  that  by 

a r ' g 

(H  ),  a'  c l\o,oo))  one  has 

d 
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We  shall  need  to  consider  tvyo  cases: 


in  case  (i)  a(t)  satisfies  (H  ) with  a(t)  Ac  ^ A > 0,  \ > 0 (i.e. 

d 

a'(t)/a(t)  ^ -X): 

In  case  (11)  a(t)  satisfies  (H  ) with  a(t)  = Ae  , A > 0,  X > 0. 

Case  (1).  Define  a number  -p,  p > 0,  by  the  relation  4)(q',k)  = 9(-P):  -P 
exists  in  view  of  (3.6)  and  (H^).  Since  g may  take  the  constant  value 
<|.(a,n)  on  some  interval  JC(-««^0],  we  define  -P  uniquely  by  taking 
it  to  be  the  right-hand  end  point  in  such  a case.  We  then  have 


-p  0 

(3.11)  1(a)  = / a'(a  - s)F(g(-p), g(s))ds  + / a'(a  - s)F(g(-p) , g(s))ds 

-P 


.00 


+ J a'(a  - s)F(4)(a, p), <|>(s, p))ds 
0 


Since  ^'(a,|Ji)  = 0 wc  also  Iiave  from  (1.1) 

. -p  0 

(3.12)  0=/  a(a  - s)F(g(-p),g(s))ds  4 / a(a  - s)F(q(- p) , Q(s))dG 

-00  -p 

a 

+ J a(a  - s)F(<j>(o,fi),  <f(s,p))ds  . 

0 

We  next  define  the  function  o-  by  the  relation 

and  we  observe  that  the  log  convexity  of  a implies  that  ofs)  is 
negative  and  nonincreasing:  moreover,  since  a'(t)/a(t)  ^ -X,  X > 0,  o(s) 
is  strictly  decreasing,  at  least  on  some  interval  contained  in  (-‘®,a]  . 
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We  rewrite  (3.11)  in  the  equivalent  form 


(3.13) 


1(a)  = J (r(s)h(s)ds  + / cr(s)h(s)ds 

-CO  -p 


where 


h(s)  = 


a(a  - s)F(g(-p),g(s))  (-00  < s < 0) 

a(a  - s)F(«^(a,^i),«^(s,p))  (0  < s < a)  ; 


we  also  write  (3.12)  in  the  equivalent  form 


(3.14) 


0 = /■  h(s)ds  + / h(s)ds  . 
-00  -p 


From  the  definition  of  -p  and  (3.6)  one  has  1<  g(-P)  < g(0).  Therefore, 


(H  ),  (H  ),  (H  ) imply  that 

d 1 Q 

(3.15) 


h(s)  >0  (-M  < s < -p) 


with  strict  inequality  for  largo  negative  s,  and 


(3.16) 


h(s)  <0  (-P  < s < o) 


Combining  (3.13),  (3.14)  yields 


(3.17)  1(a)  = / (v(s)  - a(-p))h(s)ds  4 / {^[s)  - <7(-p))h(s)ds  . 

-00  -P 


.(3.18) 

(3.19) 


«r(s)  > a(-p)  (-00  < s < -P)  , 

«r(s)<cr(-p)  (-p<s<a), 
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with  strict  inequalities  holding  cither  for  s negative  and  large  or  for 


s near  a.  Using  (3.15),  (3.16)  and  (3.18),  (3.19)  in  (3.17)  shows  that 
each  integral  in  (3.17)  is  nonnegative  and  at  least  one  of  them  is  positive. 
This  proves  (3.10)  and  hence  also  (3.7)  and  (3.4).  Tlie  proof  of  the  global 
existence  and  uniqueness  of  the  solution  (p(t,p)  of  (1.1),  (1.  2)  and 
of  property  (2.1)  in  case  (i)  is  then  completed  by  a straightforward 


continuation  argument. 

Case  (il).  The  above  balancing  argument  establishing  (3.7),  and  hence 
(3.4),  cannot  be  used  when  a(t)  = Ae  A > 0,  X > 0,  since  for  this 
case  it  is  readily  established  from  (1.1)  and  (3.9)  that  (["(ajiJL)  = 0 
whenever  <}?'(q',ji)  = 0.  Instead  we  proceed  as  follows. 

Define  the  number  y^,  ^ Yq  g(0),  by  the  equation 
Xs 

J e F(yp,g(s))ds  = 0.  (For  the  proof  of  the  existence  of  a unique  y^ 

. 00 

with  this  property  in  the  general  case  of  a(t),  which  also  applies  here, 
see  the  proof  of  (2.8)  in  Section  5.)  Since  as  in  case  (i)  c|’(0,ij)  < 0, 

a repetition  of  the  argument  (3.1)  - (3.  3)  above  with  a(t)  = Ae  , A > 0, 
X > 0,  and  "1"  replaced  by  y^  In  (3.  2)  and  the  paragraph  following 
(3.  2),  shows  that  q(t,p)>yQ  for  t > 0,  so  long  as  <{''(t,^J.)  < 0.  To 
show  that  4''(t,p)  < 0 for  all  t > 0,  differentiation  of  (1.1)  with 
a(t)  = Ae’^^,  A > 0,  X > 0,  yields  the  equation 


ji<{>”(t,p)  4 4•(t,^l)[X^i  + / e‘^^^‘®Vj«,(t,p),4(s,p))ds]  = 0, 


with 


1 r®  X.C 

«J>(0,ji)  = g(0)  , e F (g(0),  g(s)  )ds  < 0 . 

Since  Fj>0,  X>o,  n>0,  an  elementary  argument  shows  that  4>*{t,n)<0 
so  long  as  the  solution  <}>{t,ji)  exists.  This  completes  the  proof  of  (Z.l') 
In  case  (11). 


To  prove  (2.2)  subtract  the  equations  (1.1)  for  4>j  and  <}>^  and 
apply  the  mean  value  theorem  obtaining 


t 

+ / a(t  - s)F^(<})^(t,fjL),  T(s))(<j)^(s,p)  - <|>  (s,fi))ds  , 

- 00 

where  or(t)  is  between  <}:^(t,p)  and  <})^(t,p)  and  t(s)  is  between 
<|)^(s,p.)  and  <J;2(s,p).  Let  f = l^st  point  for  which 

i.e.  ^ while  <)|(t,  p)  > (^^(t,  p) 

for  -00  < t < t^.  Since  g^(t)  > -<»  < t < 0,  it  is  clear  that 

integrals  exist  since  a c L\o,to)j  F c c\  and 
satisfy  (2.1)  (or  (2.1')).  From  the  definition  of  t and  F < 0 one  has 

V Cr 

-p(4r|(t^, ^i)  - £ 0,  with  the  equality  sign  holding  if  and  only 

if  g,  9,  on  (-00,0].  Since  p>0  this  implies  <'J(t„,  t^)  - (t  ) > 0 

L c i 0 2 U 

for  g^  5^  g^  and  this  is  impossible. 

To  prove  (2.  3)  put  z(t)  = <l;(t,Pj^)  - <|>(t,p^).  Then  from  (1.1),  (1.2) 

one  has  z(0)  = 0 and 

1 1 ° 

z*(0)  = (-  — +—)/  a(-s)r(g(0),g(s))ds  > 0 . 

Hi  :!«> 


i 
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% JL 

+ “ / a(T  - s)F(<}-(T,ji  ),<jr(s,^l  ))ds  . 
^2-00  c-  c 


T 

But  by  (2.1)  / a(T  - s)F(<j:(T,  ji.),  4,(5,  ))ds  >0  (i  = 1,2).  Therefore 
-00  ^ ^ 

that 
1 ^ 

/^a(T  - s){F(4,(T,F2),<t(s,P2))  ' F(4(T,j.^),  4(s,^j))}ds  , 

•which  by  the  definition  of  T and  <^(t,  ji^)  = 4)(t,  fi^)  = g(t)  on  (-«>,0]  yields 
1 r^‘ 

(3.20)  z'(T)  > — J o(T  - s){r(d(T,|i  ),4.(s,fi  ))  - F(4-(T,  ji  J , 4,(5, ))  }ds  . 

jij  0 c c u i 


Applying  the  mean  value  theorem  in  (3.  20)  gives 
1 ^ 

(3.21)  z'(T)  > — / a(T  - s)F  (4>(T,u  ),C(s))«:(s,p  ) - 4(s,HL,))ds  , 

Pi  0 z z z i 

where  ^(s)  is  between  4(5, u^)  and  4-(s,p^).  Since  a(T  - s)  > 0 

and  F < 0 by  assumption  and  since  4(5, p)  - 4(5, p.)  <0  on  0<s<T 

Ct  ^ X 

by  the  definition  of  T,  (3.21)  implies  that  z'(T)  > 0.  Therefore  there 


cannot  exist  a T > 0 such  that  z(T)  = 0;  this  proves  (2.  3),  and  completes 


4.  Proof  of  Theorem  2 find  Corollnry  2.1.  Let  ^ be  the  solution  of  (1.1), 
(1.2).  We  return  to  equation  (3.8)  obtained  by  differentiating  (1.1)  and 
we  write  (3.8)  in  the  form 


I 

\ 


I 


(4.1) 

where 

(4.2) 

(4.3) 


-H4>"(t,H)  = G(t,^x)«t’(t,^l)  + f(t,H)  (0  < t < 00  ) , 


V 

■ s)F^(4.(t,(jL),^(s,ji))ds  , 


-00 


£{t,ji)  = / a'(t  - s)F((}>(t,^l),<))(s,|JL))ds  . 


- 00 


Since  > 0,  a c L (0,co),  a(t)  > 0 (0  < t < =o),  and  since  satisfies 
(2.1),  we  have  by  (Hg  ), 


(4.4) 

where 

(4.6) 


0 < yA  < G(t,  ji)  < rA  (0  < t < 00^  ^ > 0)  , 


r Y = inf  F (y,z),  r = sup  F (y,z),  S = [l,g(0)]  X [l,g(0)] 
' S S 


CO 


A = / a(s)ds  . 

0 

We  next  show  that  there  exists  a constant  K>  0,  independent  of 
ji,  such  that 


00 


(4.6) 


|f(t,|jL)  I < K J a(3)ds  (0  < t < 00,  n > 0)  . 

t 


Write 


f(t,p)  = where 


i 
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Moreover,  the  boundedness  of  <}>  and  g and  the  continuity  of  F imply  that 

0 « 

(4.9)  / a(t  - s)F(<];(t,H.),g(s))ds  < sup  IF(<^(t, p), g(s)) ! / a{s)ds  . 

-eo  0<t<»  t 

-oo<s  < 0 

Combining  (4.7),  (4.8),  (4.9)  shows  the  existence  of  a constant  K, 
independent  of  p,  such  that  satisfies  the  estimate  (4.6).  In  a 

similar  way  one  finds  on  using  the  log  convexity  of  a that 

00 

|l  |<(-a(0))  sup  lF(,f.(t,p),g(s))|  / a{s)ds  . 

^ 0<t<  « t 

-oo<  s <0 

Combining  the  estimates  for  and  establishes  (4.6). 
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Returning  to  (4.1)  we  have 


(4.10)  ^ (t,H)cxp(- / G(s,p)ds))  = exp(-  / G(s,|i)ds) 


By  (4. 6)  one  has,  for  0 < t < « and  p.  > 0, 

■ it  t ' 

(4  11)  Jidaiii  exp{- J G(s,p)ds)  < - exp(- / G(s,p)ds  + log  f 


a(s)ds) 


Before  Integrating  (4.10)  observe  that  by  a,  a'  € L (0,«)  and  by  the  log 
convexity  of  a one  has 


;!(;;/  G(s,^)cis  + log  / o(s)ds)  = 


J a(s)ds 

t 


J a’(s)ds 

= + 0(1)  (0  < t < « H > 0)  . 

H 00  _ J r 

J a(s)ds 

t 

A simple  but  tedious  calculation  then  shows  that  by  use  of  (4.4),  (4.6)  in  (4.11) 
there  exist  constants  > 0,  K = K(p^)  > 0 such  that 


(4.12)  1/  ^^^exp(-/  G(s,p)ds)d|I 


t 

s K exp(-^  f G{s,u)ds  + Iog(/  a(s)ds))  (0  < t < »:  0 < p < p.)  . 
•*0  t " ° 

Then  integrating  (4.10)  and  using  (4.12)  and 


1 U 

■4.'(0,p)  / a(-s)F(g(0),g(s))ds  , 

1*  -00 


1 
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as  well  as  conclusion  (Z.l)  ol  'fhcorcm  1,  yields  the  estimate  (2.5  ), 
where  = \A.  This  completes  the  proof  of  Theorem  2. 

Proof  of  Corollary  2.1.  Integrating  (2.5  ) from  t to  infinity  and  using 
conclusions  (2.1)  and  (2.4)  of  Theorem  1 yields 

^ -K  t/fi  ^ * 

0 < ^^(t,p)  - ttip)  < Ke  + K J (f  a(G)ds)d^  (0  < : 0 < t < «)  : 

t I 

an  integration  by  parts  gives 

-K.t/p  . « 

(4.l3)  0 < <|i(t,p)  - aip)  < Ke  ^ J (s  - t)a(s)ds  (0  < |i  < : 0 < t c «^) 

Since  a(t)  is  log  convex,  log  a(t)  is  bounded  below  by  an  affine 
function;  thus  there  exist  constants  o-  > 0,  p > 0 such  that  a(t)  > ac 
(0  <’t  < ‘o).  Kence  given  any  ® there  exists  a constant  which  wo 

again  coll  > 0 such  that  the  integral  term  in  (4.13)  dominates  the 
first  term  for  tQ<t<oo,  0<ji:s  this,  together  with  (4.13), 

establishes  (2.7)  for  0 < t^  < t < oo  , 0<p:£pQ,  and  one  obtains  the 

estimate  (2.7)  with  t^  = 0 by  an  appropriate  modification  of  K- 
This  completes  the  proof  of  Corollary  2.1. 

5.  Proof  of  Theorem  3 and  Corollary  3.1.  We  observe  first  that  if  4>q 
Isa  (continuous)  solution  of  (1.6)  for  t >:  0,  then  4*q(0)  = ‘I>q(  0^) 
must  satisfy 

(5.1)  0 = J a(-s)F(4.Q(0),  g(s))ds  . 

- 00 
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This  Integral  clearly  exists  for  any  number  4>q{0)  since  a e L (0, oc) 
and  (Hp),  (H^)  hold;  moreover  by  (Hp),  the  integral  In  (5.1)  is  a 
continuous,  strictly  Increasing  function  of  (the  parameter)  <(>^(0).  The 
Integral  is  negative  if  one  chooses  4>q(0)  = 1;  it  is  positive  if  one  chooses 
<>q(0)  = g(0).  Therefore,  there  exists  a unique  number  <j>Q(  0)  = y^  for 
which  the  integral  in  (5.1)  vanishes  and  clearly  1 < y^  < g{0).  This 
proves  (2.8). 

We  postpone  to  the  end  of  this  section  the  proof  of  the  existence 
and  uniqueness  of  the  solution  <{)q  of  (1.  6)  and  of  the  fact  that 
•^0  ‘ ■ 4>q(0^)),  and  we  proceed  to 

establish  the  remaining  conclusions  of  Theorem  3.  If  4’q  * C^IO,  oo), 
then  differentiation  of  (1.6),  justified  by  (Hg).  (Hp)i  (Hg)»  yields 

t t 

(5.2)  0=  / a'(t-s)  F(<t>Q(t),  4>Q(s))ds  + 4.^(t)  / a(t  - s)Fj  ( 4.^(1),  <J)Q(s))ds 

-00 

(0  < t < oo). 

We  shall  establish  conclusions  (2.9)  - (2.11)  by  using  (5.2).  Note  that 
the  coefficient  of  4.^^  in  (5.2)  is  bounded  away  from  zero  by  (Hp). 

A simple  calculation  shows  that  in  the  special  case  a(t)  = Ae  ^ , 
A>0,  >.>0,  we  have  4>'q  (t)  =0  for  t>  0 (from  (5.2)),  so 

that  = Vq  t 2:  0,  and  this  proves  the  remark  below  (2.9). 

If  a(t)  Ae~^^  , A > 0,  X > 0,  we  wish  to  show  that  (2.9)  is 
satisfied.  From  (5.2)  we  first  have 
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(5.3) 


0 


/ a'(-s)l’{y  ,g(s))ds 

-00  ^ 


J a(-E)F^(yQ,g(s))ds 


-00 


By  (Hp),  (H  ) the  denominator  in  (5.3)  is  positive.  To  determine 

a I Q 


the  sign  of  the  numerator  we  note  first  that  by  the  definition  of  y 


(5.4) 


V/ 

/ a{-s)F(y|j,g{s))ds  = 0 , 


.00 


and  by  the  log  convexity  of  a,  repeating  the  argument  in  Theorem  1 which 
shows  that  the  integral  I(a')  > 0 in  (3.10)  (here  we  take  o = 0, 

4'(«'>i‘)  = Yqi  <?(s,M.).=  g(s),  and  define  -p,  p > 0,  by  the  relation 


g(-p)  = y^j),  we  find 


(5.5  ) 


J a'(-s)F(yp,  g(s))ds  > 0 . 


-00 


Therefore,,  by  (5.3) J (5.4),  (5-5),  one  has  <1jq(0)  < 0. 


Since  (j)^  c c\o,<»),  by  continuity  one  has  4>Q(t)  0 < t < a, 

for  some  o-  > 0.  We  claim,  as  in  the  analogous  part  of  the  proof  of  Theorem  1 
(see  the  proof  of  (3.2)),  that 

(5.6)  (0<t<o). 


Interval  lo  llic  riglu  of  t = 0.  Lcttlna  0 < 5.  “ first  point 

at  which  ond  using  > 1 for  0 < I < t^,  we 

substitute  in  (1.6)  with  ^ and  see  that  the  right-hand  side  is 

strictly  negative.  This  contradiction  proves  (5.6). 

We  next  show  that 

( 5-7  ) «j>|j(t)  <0  (0  < t < 00)  . 

Here  the  procedure  differs  somewhat  from  the  analogous  part  of  the  proof 
of  Theorem  1 in  that  we  do  not  need  to  compute  (j)^.  Suppose  t = a is 
the  first  point  at  which 

(5.8)  <|>Jj(ff)  = 0 and  <})^(t)  < 0 (0  < t < a)  . 

Moreover,  by  (5.2)  we  have 
C 

J a'(Q'  - s)F(<|-^{q'),  g(s))ds  + / a'{a  - s)F((j)^(Q'),  (.s^(s))ds 

(5.9)  = 

J a(c  - s)Fj(<{)^(q'),  g(s))ds  + J afo-  - s)F  (<{)p{Q'),  <>Q(s))ds 
-00  0 

The  denominator  in  (5.9  ) is  clearly  positive  by  (H  ) (H  ) (H  ).,  The 

a F g 

balancing  argumeht,  involving  (1.6)  with  t = a and  the  log  convexity  of 
a which  has  been  previously  employed  in  the  proof  of  (3.10)  and  (5.5), 
shows  that 

0 a 

J a'(ff  - s)F(<{>p(Q'),g(s))ds  + J a'(a-  - s)F(<j)^(Q'),  ^^(sDds  > 0 . 

• CO  0 

Therefore  ^ 0|  controdicting  (5.8)  arid  proving  (5.7  ).  This 

completes  the  proof  of  (2.9). 
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The  proof  of  properly  (2J0)  Ic  similar  to  that  of  (2.  2)  in  Tiieorcm  1 
and  is  omitted. 

To  prove  property  (2.11)  we  observe  first  that  since  «}'(0,p)  = g(0) 
and  since  <|>q(0)  = Yq,  (2.11)  is  true  at  t = 0 by  (2.8),  and  therefore, 
by  continuity,  (2-11)  is  true  in  some  interval  of  t > 0.  Suppose  tiiat 
4«(t,p)  - <»o(t)  is  zero  for  the  first  time  at  t = t^  > 0 and  4)(t,p)  - 4>p(t)  > 0 
(0  < t < tjj).  Then  from  (1.1),  (1.6)  at  t = t^^,  one  obtains  by  subtraction 

‘o 

(5.10)  =/■  a(tj  - s)(F((,(t||,H),(.(s,^))  - F((,^(tj,).(,|j(s))]ds  . 

Applying  the  mean  value  theorem  to  the  difference  under  the  integral 
(note  that  (j5(tjj,ii)  = W iho  definition  of  t^)  and  using  F^(y,z)  < 0 

and  <{>(s,p)  - «j>Q(s)  >0  (0  < s < t^),  it  is  evident  that  the  right-hand 
side  of  (5.10)  is  negative.  This  implies  that  <}>'(tQ,p)  > 0 which 
contradicts  (2.1)  and  proves  (2.11). 

It  remains  to  prove  the  existence,  uniqueness  and  differentiability 
of  the  solution  «j>p  of  (1.6)  for  0 s t < «.  Let  0<e<t<  T<oo 
and  let  (Pn3n  = 0 arbitrary  sequence  with  , lini  Pn~ 

Oft  ri  00 

Consider  the  sequence  solutions  of  the  initial  value  problem 

(1.1),  (1.2)  on  e < t < T.  By  Theorem  1,  1 < <j)(t,  (i^)  < g(0),  and 

♦(tfPn+jX  «l>(f.Pjj).  e :<T,  n = 0, 1,  ...;  therefore,  z(t)  = lim  4)(t,p^), 

n — ► 00 

E St  s T,  exists.  By  the  estimate  (2.5)  of  Theorem  2 the  sequence 

00 

f Is  equicontinuous  on  the  interv/al  € < t < T for 

**  n=  0 — — 
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I 

I 

1 


0 < Hjj  £ Ascoli-Arzela  theorem  and  the  fact  that  the 

sequence  is  already  known  to  converge  pointwise  to  2(t),  we  see  that  the 
convergence  to  z(t)  is  uniform  for  c<t<T,  c > 0 being  arbitrary,  and 
so  z(t)  is  continuous  for  t > 0.  Passing  to  the  limit  as  n — <»  in 
the  equation  (1. 1)  for  <})(t,  we  see,  again  using  the  estimate  (2.  5) 

and  Lebesgue's  convergence  theorem,  that  z(t)  satisfies  (1.6)  for  t > 0. 

Since 

(5. 11)  <|.(t^,  p^)  > pj  if  0 < < tz  > 

we  see  in  the  limit  as  n — «»  that  z(t)  is  a nonincreasing  function  of 
t and  so  2(0+)  exists.  If  we  define  z(0)  = z(0+),  then  clearly  z(t) 
satisfies  (1.6)  for  t = 0 as  well  as  for  t > 0. 

Identifying  z with  the  required  solution  4>q,  we  can  show  4)^ 
to  be  continuously  differentiable  by  forming  difference  quotients  in  (1.  6), 
applying  the  mean  value  theorem  and  showing  that  we  can  pass  to  the  limit 
to  prove  both  the  existence  of  (j>^  and  the  equation  (5.2). 

To  prove  uniqueness  let  u,  vc  C[0,  T],  T>0  arbitrary, 
u{0)  = v(0)  = y^  , u(t)  = v(t)  = g(t)  on  - 00  < t < 0,  be  two  continuous 
solutions  of  (1.6)  on  [0,T]  with  u(t),  v(t)  > 1 on  [0,T].  Then 
by  (Hp)  and  the  mean  value  theorem  we  have 


0 

(5.12)  0=  (u(t)- v(t))[  J a(t-s)  Fj(tj(t,  s),g(s))  ds 


for  some  ^{t,s),  |2(t,s),  ^(t,s)  between  u(t)  and  v(t),  and  for  some 
Tij(^s),  112(1*  s)  between  u(s)  and  v(s).  But  then  ac  L^{0,ooh  the 
continuity  of  F^,  F2,  > 0,  and  the  cx)ntinuity  of  u,  v on  [o,  T], 

together  with  Gronwall’s  inequality  applied  to  (5.12),  imply  that 
u(t)  H v(t),  OstsT,  where  T>  0 is  arbitrary,  establishing  uniqueness 
of  a continuous  solution  <j>Q  of  (1.6)  on  [ 0, «)  with  <})Q(t)  > 1 , 

0 s t < 00.  This  completes  the  proof  of  Theorem  3. 

Proof  of  Corollary  3.1.  To  establish  (2.13)  note  first  that  by  Corollary 
2.1^  <l>(T,ii) -*a(^)  as  T — 00  uniformly  in  |j.  for  0<p.  Let 

Ti>  0 be  given.  Using  (2.4),  (2.11),  (2.12)  choose  T>0  so  large  that 
l“(p) -'l>{t.p)l  < y and  0<  .(.^(t)  - a^<  for  t>T,  0<p<Pq.  By 
the  convergence  of  <}>(t,  p)  to  4>Q(t)  as  p— 0^  on  0 < t < 00  choose 
0<Pq£Pp  sufficiently  small  that  0 < <t> (T, p ) - <})p(T)  < 0<p£pQ. 

Then 

0 <o(p)-a^  s 1 a(p) -4>(T,p)l  + (4.(T,p)- «|>q(T))  + (c!>q(T) - a^)  < , 

for  0 < p < p^,  proving  (2. 13). 

The  first  inequality  in  (2.14)  follows  from  (2.11).  To  prove  the 
second  Inequality  in  (2.14)  note  that 


.t.(t,p)  - yt)  = 4.(t,p)  - a(p)  +a(p)  - yt) 

< «j.(t,p)  - a(p)  + a(p)  - . 
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where  the  last  step  follows  by  Theorem  3 ; (2.14)  now  follows  by 
estimating  4)(t,p)  - ct(p,)  by  (2*7).  This  completes  the  proof  of 
Corollary  3.1. 


6.  Proof  of  Theorem  4 . In  view  of  the  monotonicity  property  (2.10)  of 
solutions  of  (1.6)  with  respect  to  the  function  g,  it  suffices  to  prove  the 
result  for  the  function  g given  by 


(6.1) 


g(t)  = 


1+6  If  -r\^t  :<  0 

< 

1 if  t < -11  , 6 > 0,  Ti>  0 . 


(Strictly  speaking,  this  function  g does  not  satisfy  the  hypothesis  (H  ), 

g 

being  discontinuous,  but  it  is  readily  verified  that  the  proof  below  would  be 
essentially  unaltered  if  the  given  g were  replaced  by  a continuous  g 
approximating  sufficiently  closely  to  it.)  Since  1 < g(t)  < 1 + 6 , -oc<ts  0, 
property  (2.9)  implies  that  1 ^ 4>Q(t)  ^1  + 6,  0:st<cio.  This  means 
that  the  arguments  of  F in  (1.6)  are  close  to  1,  if  6>0  is 
sufficiently  small  which  will  be  the  case  in  what  follows.  For  this  reason 
and  in  order  to  simplify  the  calculations  we  assume,  consistent  with  (Hp  ) 
and  without  loss  of  generality,  that 


(6.2)  Fj(l,l)  = 1,  F^d,!)  = -1  ; 

note  that  r(x,x)  = 0 (x  > 0)  implies  that  F^{x,x)  = -F^(x,x). 
Substituting  (6.1)  into  (1.6)  yields  (note  a c L^(0,oo)) 


Using  (6.2),  the  mean  value  theorem,  and  F € ( IR  x R ) yields 

0 

r 

■h 


f %(t)  - 1)/  a(t  - s)ds  + (4.Q(t)  - 1 - 6)  / o(t  - s)dj 


(6.4)  < 


0 t 

+ 0(6)  J a(t  - s)ds  + / a(t  - s)(<}>  (t)  - <|)  (s))ds 

/\  U 0 


..00 


+ / a(t  - s)[o((j.Q(t)  - 4.^(s))]ds  = 0 , 


1 ‘ ° 
where  (by  a c L (O,*®),  a(t)  > 0)  the  terminology  w(t)  = o(6)  f a(t  - s)ds 


00 


means  that  for  ever^’  c>0  one  has  lv/(t)i<  csj  a(9)d|  for  t>0 

t 

and  for  6 > 0 sufficiently  small.  An  equivalent  form  of  (6.4)  is 

« t t+q 

(6.5)  {«|.  (t)  - 1)  / afe)d|  - / a(t  - s)(<t  (.s)  - l)ds  = 6 / a(e)de 

0 0 t 

00  t 

+ 0(6)  / a(e)de  + / a(t  - s)[o«>  (t)  - <))  (s))]ds  . 
t 0 u u 

Since  1 < ^Q(i)  <1+6  on  0<t<oo,  (H^)  implies  that  the 

first  term  in  (6.  3)  is  positive,  while  the  second  and  third  terms  are 

negative.  Thus  the  third  term  in  (6.  3)  must  be  in  modulus  less  than  the 

first  term.  Tliis  in  turn  implies  that  the  last  term  in  (6.  5)  (or  the  last 

00 

integral  in  (6.4))  must  be  o(6)  J a(4)dC>  Therefore,  putting  z(t)  = ^ 

t ° 

in  (6.  5)  yields  the  equivalent  equation 


00 


(6,  6)  z{t)A  - a * 2(t)  = 6 / a(s)ds  + o(6  J ' a(s)ds)  (0  < t < «>)  , 


00 


where  ♦ denotes  the  convolution  and  A = f a(s)ds. 

0 
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■We  shnll  iiov.^  Cipply  a simple  Tauboriun  thoorem  lor  (real)  I.aplaco 
translorins  [20;  Tneorem  4.3,  p.  192]  to  soluUoiis  of  (6.6).  Put 


(6.7) 


4<(t)  .=  / a(s)ds,  u)(t)  = j a(s)ds  , 

t t 


(6.8) 


z(p)  = J e ^*’z(t)dt  (p  > 0)  . 

0 


Since  z is  bounded  and  continuous  on  [0,°°),  end  since  a « L (0,«')^ 
as  well  as  4<{p)  and  w(p)  exist  for  p > 0.  Noting  that  A = a(0)  and 
that  multiplication  of  (6.6)  by  e’*^^  and  integration  preserves  the  o- 
relation  when  p is  real,  we  obtain  on  solving  for  z(p) 


(6.9) 


z(p)  =. 


a(0)  - a(p) 


(P  > 0)  . 


By  as.sumptions  (H  ),  (2.6  ),  and  l.ebesgue's  dominated  convergence 

d 

theorem,  and  integration  by  parts  we  also  have 


(6.10) 


<»  (0)  = J J a{s)dsdt  = f sa(s)ds  , 
0 t 0 


(6.11)  4^(0)  =/  J a(s)dsdt</  / a(s)dsdt  = / sa(s)ds  . 

0 t 0 t 0 

Moreover,  by  Fubini's  theorem  and  (2.6  ) 


(6.12) 


>(P)  = • / te  '^*a(t)dt  (p  > 0)  , 


so  that  by  the  mean  value  theorem  and  (2.6  ) , 


(6.13) 


(0)  - a(p)  ~ P / ta(t)dt  (p  - O"*^)  . 
0 


N-  > 


urii.r  ft.l]),  (t.l:),  ls-.‘.inc  ? - In  (t.9),  nn-J  i,-.-... 

I > 0 r.uJficic'nl]/  r.r.-.n!],  Ihcre  oxisic  6 constr-.nl  K{0)  > 0 iwcr,  lh,c'. 


(6.14)  2(p)  - (P  ^ 0^)  , 

P 

The  above-mentioned  Tauberian  theorem,  together  with  the  fact  that 
lim  z(t)  exist,  then  implies 

t-Ki 

z{i)  - K(6)  >0  (t  - -i  ‘■=)  , 


c.fiz  by  '.he  cefinition  of  z (6.15)  yields 

4-p(t)  ~ 1 4 K(6)  >1  (t  --  -f  CO)  ^ 


for  6 > 0 sufficieritly  Emell.  This  completes  the  proof  of  Theorem  4. 


jL‘  .CL9PL^LZl!P-9.LPj]lA'  Using  (2.16)^  eqnolion  (1.6)  ir.ov  be  wiitten  in 
tTie  form 


0 

(7.1)  Ay(t)  - n y(t)  = J e(l  - £)g(s)ds  (f>  < t < <^')  , 

- 00 

CO  t 

where  A = J a{5)cs  end  e - y(t)  = f a(t  - s)y(s)ds.  Letting,  as  in 
0 0 

trje  proof  of  Theorem  4,  y(t)  ^ z(t)  - 1, 

(7.1)  becomes  the  linear  Vol'.erre  ecoetion 

(7.2)  Az(t)  - a 4.  z(t)  ^ G(t)  (0  < I < c^)  , 

where 

0 

(7.3)  G(t)  = / a(t.-  s)(g(s)  - l)ds  (C  < l<  oo)  . 

- 00 
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We  proceed  as  in  the  proof  of  Theorem  4,  but  taking  any  g satisfying  (H  ), 
rather  than  the  special  g of  (6.1).  Taking  the  Laplace  transform  and 
noting  that  A = a(0)  we  obtain 


(7.4) 


z(p)  = X 


a(0)  - a(p) 


By  Theorem  3 lim  z(t)  = z exists  and  z^  > 0.  If  z >0,  a standard 

t-*  CO 

Abelian  theorem  for  Laplace  transforms  [20,  Cor.  lb,  p.  182]  states  that 
^ ^00  + 

z(p)  " “ (P  — 0 ).  We  shall  apply  this  result  to  (7.4)  for  p real. 

Since  by  (H^)  g(t)  > 1 (-co  < t < 0),  and  since  g is  nondecreasing 
with  g(t)  > 1 near  t = 0,  we  have  by  Fubini's  theorem 


(7.5) 


^ 00  0 

0 < G(p)  - J e f a(t  - s)(g(s)  - l)dsdt 

0 r 00 

0 00 

= J (9(s)  - 1)  / e '^^a(t  - s)dtds  (p  > 0) 
-00  0 


Let  c > 0 be  given:  choose  a number  N = N(c)  > 0,  and  using  (H  ) 
divide  the  interval  (-°®,0]  into  two  parts,  such  that 
(7.6)  g(s)  - 1 < e (s  < -N)  . 

Then  (7.6)  used  in  (7.  5)  yields 


0 < G(p)  < c / J e'^®a(0)dCds  + 0(1)  (p  - o"^)  , 

-00  -s 
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or  equivalently 


(7.7)  0<G(p)<c/  e'^°a(0)d8da  + 0(1)  (p-‘0^. 

0 cr 


By  an  integration  by  parts  and  a € L (0,°^)  we  also  have 

00  CO 

(7.8)  a(0)  - a(p)  = / (1  - e’^Sa(t)dt  = J (1  - e'^Se’^^e ■^^a(t)dt 

0 0 

00  00 

~ P J / e *^^a(0)d0dt  >0  (p  > 0)  . 

0 t 

Therefore,  (7.7),  (7.8)  substituted  into  (7.4)  yields 

00  00 

c J e^^  f e ^^a(0)d0dt  + 0(1) 

(7.9)  ■ ° .o  (P-0*). 

p J e^*  J e *^^a(0)d0dt 

0 t 


But  by  (2.17  ) 

00  00 

lim  J e*^^  J e ^^a(0)d0dt  = + oo  ^ 

p-*0  0 t 

and  this  fact  used  in  (7.9)  shows  that  for  any  c > 0 

z(P)  ^ p (P  * 

Since  c > 0 is  arbitrary,  z^>  0 is  impossible.  This  completes  the 
proof  of  Theorem  5. 
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8.  Proof  of  Thcurcni  6.  Let  <})q  bo  the  unique  solution  of  (1.6)  (see 
Theorem  3).  Thus  «l>p  sotisfies 

0 t 

(8.1) /  Q(t  - s)F(<j.  (t),g(s))ds + / a{t  - s)F(«j)p(t),  <})  (s))ds  = 0 (0  < t < . 

-CO  0 

By  a t (Hp),  (H^),  and  the  boundedness  of  4:^  tlierc  exists 

a constant  K > 0 such  that 

0 . ®o 

(8.2)  ®</  a(t  - s)F(<{)  (t),  g(s))ds  < K / a{s)ds  (0  < t < «)  . 

-w  t 

(The  first  inequality  in  (8.  2)  follows  by  simple  consideration  of  signs  in  (S.l).) 
Applying  (Hp)  and  the  mean  value  theorem  we  have 

= r'j%(s),4o(s))(<ijj(t)  - <>o(s))  + ofe^^ft)  - 4^(s))  , 

which  by  the  boundedness  of  4^,  F^  > 0,  and  the  continuity  of  F^ 
implies  that  there  exist  constants  M,,  M >0  such  that 

r 2 

(8.  3)  M^{4p(s)  - 4p(t))  < -Ffd^ft),  4q(s))  < M^(4q(s)  - 4^(1))  ( 0 < s < t < cc). 

Using  (8.  2),  (8. 3)  in  (8. 1)  shows  that 

t 

(8.4)  j a(t  - s)(4>  (t)  - 4^(s))ds  = 4(t)  (0  < t < w) 

0 


where 


(8.5) 


00 

= 0(/  a(s)ds)  . 

t 
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Let 


(8.6) 


2(t)  = - n-Q  (o-Q  = lim  <.p{t)) 

00 


Then,  from  (8.4),  z satisfies  the  linear  Voltcrra  equation 


(8.7) 


t t 

2(t)  f a(s)ds  - J a(t  - s)z(s)ds  = 4)(t)  (0  < t < «>) 

0 0 


Witing  J a(s)ds  = / a(s)ds  - J a(s)ds  and  noting  that  z(t)  is 
•0  0 t 

bounded  shows  that  (8.  7)  may  be  written  in  the  form 


(8.8) 


z|t)A  - a * z(t)  = w{t)  (0  < t < “0;  A = J a(s)ds  > 0) 

0 

00  00 

w(t)  = 4j(t)  + z(t)  J a{s)ds  = OlJ  a(s)ds)  ( 0 < t < «). 

t t 


We  now  solve  (8.8)  by  Laplace  transforms.  By  the  argument  of 
00 

Theorem  4 and  using  f ta(t)dt  < w,  we  find  that  if 

0 

00 

w(0)  = f w(t)dt  # 0 
0 


(note  that  this  integral  exists  in  viev/  of  the  estimate  satisfied  by  w 
and  ta(t)  t l\o,«>)),  then  lim  z{t)  - z r 0.  But,  by  Theorems, 


t-  00 


2 “0.  Tlierefore, 

00  > 


(8.9) 


J w(t)dt  = 0 . 
0 
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Now  intcgrolc  (8.8)  over  [0,T) , T > 0,,  oblainliiy 

T T T 

A f z(s)ds  - / a z(t)dt  = f w(t)dt  . 
0 0 0 

But,  by  (8.9),  the  last  equation  can  be  written  as 


(8.10) 


T T 

A f z(s)ds  - f a * z(t)dt  - - J w(t)dt 
0 0 T 


Interchanging  the  order  of  integration  in  the  double  integral  on  the  left- 
hand  side  of  (8.10)  and  using  the  estimate  for  w in  (8.8),  as  well  as 
ta(t)  € L^(0,«>),  yields 


00 


T T T-s 

(8.11)  a/  z(s)ds  - /•z(s)/  a(cr)dffds  = 0(/  (t  - T)a(t)dt)  (0  < T < «)  . 
0 0 0 T 

Using  the  definition  of  A and  combining  the  two  integrals  on'  the  left-hand 

side  of  (8 . 11)  yields 


00 


(8.12) 


X 00 

J z(s)  J a(ff)dods  - 0[J  (t  - T)a(t)dt)  . 
0 T-s  T 


Finally,  using  the  fact  that  z is  decreasing  on  0 < T < «,  we  obtain 
from  (8.12) 

XX 

z(T)  / / a(a)d(rds  = O^/  (t  - T)a(t)dt)), 

0 T-s  T 


which  on  Interchanging  the  order  of  Integration  yields 
(8.13) 

This  implies  (2.19)  and  completes  the  proof  of  Theorem  6. 


X eo 

z(T)  J aa(a)d(r  = 0(/  (t  - T)a(t)dt) . 
0 T 
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9.  Proof  of  ThoorcMn  7.  Dy  stondoid  results  lljo  Initial  value  problem  (2.22) 
has  a unique  local  solution  w *-  4(0  having 

0 

(9.1)  -i'(0}  = f a(-s)F(g(0),g(s))ds  > 0 . 

-CO 

Thus  4 decreases  initially.  To  continue  the  Iccol  solution  we  proceed  simi- 
larly to  the  proof  of  Theorem  1.  First,  we  show  that  for  as  long  as 
4'(t)  < 0 one  has 

(9.2)  4(-r)>yp, 

where  y^  is  (see  Theorem  3 ) the  unique  value  for  which 

0 

(9.3)  J a(-s)F(yp,g(s))ds  = 0 . 

-00 

We  observe  that  equation  (2.  22)  may  be  regarded  as  an  autonomous 

ordinary  differential  equation  having  tlie  point  as  its  only  critical  point. 

Recall  from  Theorem  3 that  (4(0)  =)g{0)  > y^.  If  4 assumes  the  value 

at  T = T, , then  by  (9.3) 

^0  1 

0 

-4-(t  ) = / a(-s)F(y  g(s))ds  = 0 , 

■*  -eo 

which  is  impossible  if  4'(‘r2)  < 0*  This  proves  (9.  2).  On  the  other  hand, 

4*(0)  < 0 implies. by  continuity  that  4'(''')  < 0 0 < t < or,  a > 0.  If 

a > 0 is  the  first  point  at  which  i'(a)  = 0,  then 

0 

0 = -4'(o’)  =/  a(’-s)F(4(Q'),g(s))ds  . 

.00 
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This  by  uniqueness  of  Implies  that  = y^,  and  therefore 

w = ^(t)  nnd  w s y^  would  be  two  solutions  of  (2.  22)  through  the 
point  (®,yQ)|  contradicting  uniqueness.  Therefore, 

(9.4)  t'(T)<0, 

for  as  long  as  the  solution  exists.  Now  (9.2),  (9.4)  and  a standard 
continuation  argument  yield  the  global  existence  and  uniqueness  of  the 
solution  w = i(r)  of  (2.22)  such  that 

(9.5)  t'(T)  < 0,  y^  < ^(t)  < g{0)  (0  < t < »)  , 

which  implies  that-  lim  i{-r)  = t exists  and  4 > y . 

^00  ^00  — ^0 

To  prove  (2.  23)  v.'e  combine  (2.  22)  and  (9.  3)  obtaining 
df  ° 

(9-^)  a(-s)(f’(4(T),g(s))  - F{yp,g(s))ds  (0  < t < «>)  . 

• 00 

Applying  the  mean  value  theorem  and  (Hp.)  yields  the  existence  of  a G(t,s), 
0 < 0(t^s)<  1,  such  that 

dP  ° 

<9- 7)  * ^ = / a(-s)rj(yQ-He(r,sX4(T)  - yQ),g(s))ds[  4(t)  - Vq]  . 

•-00 

00 

Let  S = [y  g{0)3  X ll,g(0)],  A = J a(s)ds  > 0.  By  (H  ) 

0 

•y  = inf  F (y,z)  > 0 . 

(y,z)cS 

Therefore  using  this  and  (9.  5)  in  (9. 7)  gives  the  differential  inequality 
(9.8)  » ^ > yA(4(t)  - y^)  (0<T<«). 
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Inlcgroting  (9.8)  ond  using  ihc  inlUul  condilion  t,{0)  = g(0)  yields 

the  second  statement  in  (2.  23),  which  also  Implies  the  first  statement  In 
(2.23). 

Applying  the  transformation  (2.  21)  it  is  clear  tliot  y(t)  = ^(t/p)  is 
the  unique  solution  of  the  initial  value  problem  (2.  20)  for  p > 0,  0 < t < 
Thus 

= / a(-s)r(^(-),  g(s))ds  (p  > 0 0 < t < «>) 

.00  ^ 

e(0)  = 9(0)  . 

I.et  <{>(t,p)  be  the  unique  solution  of  (1 . 1),  (1.  2)  on  0 < t < «,  p > 0 
(see  Theorem  1),  which  can  be  written  in  the  form 

0 

(9.10)  -p^<^(t,p)=/  a(-s)F(<;;(t,p),g(s))ds 

-00 

0 

+ t / a'(-s  + e(t,  s)t)F{4'(t  ,p),  g(s))ds 
- 00 

t 

+ / a(t  - s)F(<{)(t  ,p),<{>(s,p))ds  (0  < t < «,  P >■  0)  , 

0 

where  0 < 0(t,s)  < 1 comes  from  the  application  of  the  mean  value 
theorem  to  a(t  - s)  - a(-s).  Not  e that  since  F(<;)(t  , p),  g(s))  is  bounded 
and  a' t l\o,oo),  the  second  integral  in  (9.10)  clearly  exists.  Subtracting 
(9.9)  from  (9.10),  \ising  the  mean  value  theorem  once  more,  and 
observing  that  the  last  two  terms  in  (9.10)  are  0(t ) as  t — 0^, 
uniformly  in  p > 0,  yields 


(9.11) 


- t(^)]  = 5I(t,»x)[«l.(t,fi)  - uh]  ^ 0(t) 

(t  — 0^,  uniformly  in  p.  > 0)  , 


where 


u 

(9.12)  = / a(-s)F^(U-)+e(t.s,^x)(4.(t,^l)  - e(-)),g(s))ds  , 


• 00 


00 


0 < e{t,s,|i)<  1 . Letting  S = [ 1, g(0)]  X [ 1, g(0)]  , A = J a(s)ds,  and 

0 

using  (H  ),  (H  ) (H  ) implies 
a ’ f g 


(9.13) 


inf  5l(t,p)  > yA  > 0 , 

p>0 


where  y = inf  F {y,z)  is  independent  cf  jjl.  Using  (9.13)  in  (9.11), 

S ^ 

Integrating  (9.11),  and  applying  the  initial  condition  <J)(0,p)  =§(0)  = g(0) 
yields  the  existence  of  a constant  K = l^ftp),  independent  of  p,  such  that 

t - ^ (t-s) 

l<|>(t,p)  - U-)I  < k/  -e  ^ ds  (0<t<tp:p>0) 


• 0 


yA 


which  is  (2.24)  with  K=  K/yA.  This  completes  the  proof  of  Theorem  7 
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^0.  Pioof  of  'ritcorcm  8.  Wc  show  that 

(10.1)  w(t,ii)  = <|»p(t)  + (g(0)  - <|>p(0))exp(-  Kt^) +Y(t)|i  [log  p | , 0<t<<», 

where  <{.p  is  the  solution  of  (1.6),  Is  an  upper  solution  of  (1.1)  for 
0 < t < w for  suitable  choices  of  the  constant  K > 0 and  the  function  y. 
By  Appendix  B It  suffices  to  show  that  w defined  by  (10.1)  for  0 < t < « 
and  w(t)  = g(i)  (-w  < t < 0)  satisfies  the  integrodiffcrential  inequality 

t 

(10.2)  -pw'(t,p)  <f  a(t  - s)F(w(t,p),w(s,p))ds  (0  < t < w)  . 

-00 

We  begin  by  defining  y K in  (10.1).  For  reasons  which 

will  become  apparent  below  let 

t » 

00' 3)  = Yq  exp(Kj  J {f  a(T)dT)dcr)  (0  < t < ») 

0 cr 

where  Y->  0,  K,  > 0 are  constants  specified  below.  In  view  of  (2.6), 

° ^ BK 

Y(oo)  exists  with  Y(oo)=YQe  \ where 

B = I T a(  T)  dT  . 

•'o 

Thus  the  function  w defined  by  (10. 1)  satisfies  the  inequality 

(10.4)  lsw(t,p)  < g(0)  + Y (oc)  P |log  p I (0<t<oo). 

Let  J denote  the  closed  interval  [1,  2g(0)]  and  let  R denote  the 
rectangle  JXJ.  Let  M>  0 be  a constant  such  that 

(10.5)  sup  (|F(y,  z)\,  |F.(y,  2)1,  |f  (y,  2)1,  lF.,(y,  z)\,  |f,  (y,  2)1,  |F^^(y,  2)1) 
(y,z)€R 


s M 
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Tlion  for  nny  choice  of  Indcpencloiit  of  ji,  one  has  for  p. 

sufficiently  small  tiiat  the  ijoints 

(w|t,ji),\v{s,ii)),  (w(t,}l),<^^(s)),  {c!.Q(t),vv(s,|i)),  «Q(t),<}<y(s))  c R 

for  0 < s < t < w : this  statement  is  also  true  for  t < 0 since 
w(t,ji)  = = g(t)  for  t < 0.  Therefore  the  values  of  F and  its 

first  and  second  partial  derivatives  at  these  points  arc  by  (10.  6)  bounded 

by  M. 

1 

We  define  K in  (10. 1)  by  K = ~ mA,  where  A = / a(s)ds. 

^ 0 

With  these  definitions  of  K and  y it  remains  to  verify  (10.2). 
We  begin  by  doing  this  on  the  interval  0 < t < — p |log  pi.  To  simplify 
the  exposition  let  RHS  denote  the  integral  on  the  right-hand  side  of  (10.2) 
for  t on  any  interval  under  consideration.  We  shall  also  suppress  the 


parameter  p in  w(t,p)  when  no  confusion  arises.  By  the  mean  value 

2 

theorem  we  have  for  0 < t < — p [log  p I : 

t 

(10.7)  RHS  = / a(t  - s)F((w  - + ,{>^(1),  (w  - 4>q)(s)  + «j.Q(s))ds 

• 00 

= (w-4>Q)(t)  f a(t-s)Fj(e(t,  s),  Ti(t.  s))ds + 0(p  |log  pi). 


where  we  have  used  the  facts  that  <})q  satisfies  (1.6)  and  that 


1 
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and  the  point  {i(t,  s),  ^{t,  s))  lies  in  R.  Thus  (10.1),  (10 . 6)  used  in  (10. 7) 
yields 


(10.8)  RHS  > in.2v{(g(0)-<})Q(0))e"’''^^  + Y(t)^l  1 log  p | } + O (p  1 log  p ] . ) . 

We  can  clearly  choose  in  (10.3)  sufficiently  large  that  the  term 
mAY(t)p|log  p|  in  (10.8)  is  at  least  twice  the  O-term  if  p is  sufficiently 
small,  and  we  leave  arbitrary  at  the  moment.  Then  (10 . 8)  certainly  yields 

(10.9)  RHS  > m.^{(g(0)  - (}.^(0))e'’^^^''  + j llog  p I ) . 

On  the  other  hand,  it  follows  from  (10.1)  and  (10.3)  using  y'^t)  > 0 (0  < t < ») 
tliat 

(10.10)  -pw'd)  < -p<;V^(t)  4 K(g{0)  - c;:^(0))e'^^^^  (t  > 0)  , 

and  so  from  the  choice  of  K that  for  p sufficiently  small  the  desired 

inequality  (10.2)  holds  on  the  interval  0<t  < p | log  p | . 


We  next  verify  (10.  2)  on  the  interval  — p Ilog  p I < t < K where  K 

K Z Z 


is  a positive  constant,  independent  of  |i,  to  be  determined.  Observe 


2 

that  for  t>— pllogpl, 
K 


(10.11) 


(w  - <J)Q)(t)  < Y{t)p  Ilog  p 1 4 O(p^)  . 


TTivis  by  the  mean  value  theorem  and  the  fact  that  w(t)  = «}.jj(t)  - g(t)  (-«>  < I < C 


one  has 
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V 

(10.12)  RHS  = (w  - <!>  )(t)  / a(t  - s)F  (e(t,  s),  -nCt.  s))  ds 

^ ^00 

2ji  Ilog  1/K 

+ f .a(t  - s)(w  - ^ 

*0 

t 

4/  0(t  ' s)(w  - (|;  )(s)F  (^(t,  s),ri(t,  s))ds  (t> -jr  iillog  n|  ), 

2ji  Ilog  (ji  l/K 


where  the  point  (^(t,s),Ti  (t,s) ) lies  in  R . Let  denote  the 

first,  second,  and  third  integrals  in  (10.12)  . Then  by  (10.1),  (10.  5),  ond 
Y nondecreasing  one  has 

(10.13)  ll^  I 5.  ^ Ilog  ji  p(g(0)  ' 4>p(0))  , 

and  similarly 

2 t 

(10. 14)  ll  < M[  Y(t)p  Ilog  p I + 0(p  )1  / a(t  - s)ds  . 

2p|log  p I /K 

Note  that  I,,  I are  each  negative  while  I is  positive, arid  for  p > 0 
sufficiently  small 


(10.15) 


Ij  > *1  Y(t)p  Ilog  p I . 


Hence  if  is  chosen  Independent  of  p and  sufficiently  small,  it 
follows  from  (10.12)  - (10.15)  that  for  p sufficiently  small 


(10.16) 


RHS  > ~ m^Y{t)^i!log  pi  (.  < t s ) 

^ K 
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From  (10.1),  (10.3)  one  has  again  (lO.lO)  valid  on  the  interval  being 
considered.  Comparing  (10. 10)  and  (10.16)  one  finds  that  (10*2)  is 
satisfied  on  the  Interval  p.  llog  pi  :s  t < K^,  with  chosen  as 
above,  if  p>  0 Is  sufficiently  small.  Note  that  in  this  and  the  previous 
part  of  the  proof  we  have  not  used  the  special  form  of  Y(t)  in  any 
significant  way. 

Finally,  we  establish  (10.2)  for  any  t>  For  any  such  t 
(10.12)  is  valid  and  the  estimate  for  given  by  (10.13)  holds. 

In  fact,  (10.13)  can  be  strengthened  to  give 


and  since  a’/a  is  bounded,  we  have 


2p|log 


where 


2p  I log  p I 
K 


< T < t , 


and  so 


a(t-  = a(t)  (1+  0(p  llog  p|  ))  . 

1C 


Hence,  for  \l  sufficiently  smalls 


I s ^ a(t)  Y ( llog  ji| 


(10.17) 


+ Y ^^Cg(o)  - ^q(o)) 


Further,  since  a '/a  is  bounded,  it  follows  on  Integration  that 


(10.18) 


a (t)  / f a ( T ) d T 


Is  bounded. 


We  can  write 


= (w-V(t)  {J 


Zullog  nl  / K 


+ I 


2n  I log  I /K 


a(t-s)  Fj(|{t,  s),Ti(t,  s))  ds 


= + 1,2  , say 


Certainly, 


Ij  > m Y (t)  I log  (X I J a (t  - s)  ds 


= m Y (t)  n Ilog  I f a(T)dT  , 

t 


so  that,  if  ^ is  sufficiently  small,  we  can  use  (10.17)  and  (10.18) 


to  obtain 


(10.19) 


1 ^ 

^ 2 ^ H I log  (JL I J a(T)dT  . 


I 
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But,  from  (10. 10)  and  Theorem  2 (or  more  precisely  Remark  3.1), 
the  left-hand  side  of  (10.2)  does  not  exceed 


(10.20) 


pK  /"a(T)dT  + K(g(0)  - «}.q(0))  * 


We  have  already  remarked  in  the  proof  of  Corollary  2.1  that  a(t)  s ae” 
for  positive  constants  a,P,  and  that  consequently  the  second  term  in 
(10.20)  is  negligible  compared  with  the  first  if  p is  sufficiently  small, 
t 2:  . Comparing  (10.19)  and  (10.20),  we  see  that  (10.2)  is 

certainly  satisfied  for  p sufficiently  small  and  t 2:  if 


^12  + I3  ^ 0 


(10.21) 


I nw-4'o)(t)Fj(e(t.  s).  Ti(t,  s)) 

2p  1 log  p l/K 


+ (w-4)p)(s)  F^C^t,  s),  Ti(t,  s))]  ds 


The  hypothesis  F(x,  x)  = 0 implies  that  Fj(x,  x)  = -F2(x,  x),  and  the 

bounds  on  the  first  and  second  partial  derivatives  of  F imply  that  the 

2 2 

first  derivatives  of  bounded  by  2M  /m  . Hence  the 

expression  in  [...]  in  (10.21)  may  be  written 


(10.22)  (w 


, ,r(w-<.o)(t)  F (^(t,  s),Ti(t,s))  \ 

- ‘t>o){s)  Fj  ( e(t,  s),  n (t,  s))  g j + Fj(e(t.s).,l(t,  s))  J 


\ I 2M^  I I 

(w-<}>q)(s)  (t,s).Ti(t.s))  / ^ ^ (t.s)-  n{t.  s)l 
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From  the  application  of  the  mean  value  theorem  In  which  they  arise,  we 
see  that 

e(t,s)  = «|.Q(t)  + 0(t,  s)  (w-4.Q){t), 

Ti(t,  s)=  <j>Q(s)  + e{t,  s)(w- <|)p)(s), 

where  0 < 0 < 1 • 

Thus 


||(t,S)  -Ti(t,  S)|  :<  Uo(t)  - ‘l>o(s)|  + I (w- 4>o)(t) -(W-4.jj)(S)|  , 


and 


( w - «^Q)(t) 
(w  - 4*0)  ( s> 


1 - 


Zl/i 


1 1 (t,  s)  - ■na,  s)  1 


(10.23) 


m 

(w-«|)Q)(t) 

(w-<|)q){s) 


-1  (l-^(w-4>o)(s)) 

' m 


ZW 


m 


V*)  - 


But  since,  in  the  range  of  integration  with  which  we  are  now  concerned. 


2p  |log  ji  I / K s s s t , 


we  have 

(w-<|>Q)(t)  (Y{t)-Y(s))  p llog  111  + (g(0)-  <}.Q(0))(e'’^^/^"  - 

^ Y(s)p  I log  pi  + 0(p2) 
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If  (1  Is  sufficiently  small,  the  denominator  does  not  exceed 
■5-  Y(s)ji  llog  , and  using  (10.3)  we  have 


(w-4>p)(t ) 

(w  -4>p)(s) 


- 1 2:  y-  I exp  I 

+ oi^ , / e"^/^^do 


- 1 


ip  I log  til  s 


(10.  Z4) 


- ■f  ^ a(T)dT)  da 


+ O / 

p llog  p|  s 


.-K«r/p 


d a 


But  we  have  already  remarked  that  a is  bounded  below  by  a negative 
exponential,  so  that 

“ 3t 

a ( T ) > ae 


Further,  for  p sufficiently  small. 


-K(r/p  ^ g-3o- 


p I log  p I 


if  «r  2;  2p  I log  p 1 / K 


For  this  is  equivalent  to  saying  that 


(i_p)o.  > _ log  (p^llog  p|  ) 

= 2 I log  p I - log  1 log  p I , 


and  this  is  certainly  true  even  if  a = 2p  [log  p|  / K and  p is 


sufficiently  small.  Hence,  if  is  sufficiently  small  and  (still  at 

our  choice)  is  fixed  large  enough,  the  O-term  in  (10.24)  does  not 
exceed  half  the  first  term,  and  substituting  back  in  (10.23)  and  noting 
that  (w-4)p)(s)  is  small  for  s in  the  relevant  range  s s 2^i  | log  |i|/K 
and  (1  sufficiently  small,  we  see  that  the  right-hand  side  of  (10.23)  is 
not  less  that 

(10.25)  -|-1C  f^(  /*a(T)dT)do-  - 1 4,  (t)  - 4,  (s)  I . 

s <r  m 

Finally,  from  Theorem  2 (or  more  precisely  Remark  3.1), 

t , 

^ f <t>Q(o-)dG-  1 

^ K ( f a(T)dT)  da-  . 

0* 

and  so,  if  Kj  is  fixed  sufficiently  large,  (10.25)  is  not  less  than 

li  I (I  a(T)  d-r)  do-  . 

s a 

Substituting  this  into  (10.22)  and  then  into  (10.21),  we  conclude  that 
Ij2  + I3  > 0,  and  the  theorem  is  proved. 
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ApponcUx  A ; Stntomnnt  of  physical  problem  nnd  fornnilntion 
of  mathcinntlcol  model 

Molten  plastics  commonly  exhibit  largo  plastic  recovery;  for  example, 

4c 

a filament  of  a certain  polyethylene  ('Melt  1'  at  150*C),  when  elongated 
at  a rate  of  1 cm/sec/cm  from  an  initial  length  of,  say,  1 cm  to  a length 
of  55  cm  and  then  released,  will  reach  a final  length  of  about  5 cm  [15]  . 
Such  clastic  and  other  rheological  properties  are  of  interest  in  the  process- 
ing of  plastics  and  rubber  and  also  as  examples  of  materials  with  'memory* 
The  correct  equations  for  describing  the  isothermal  behaviour  of 
a given  molten  plastic  are  not  yet  known.  One  set  which  has  been  used 
with  some  success  [8]  for  'Melt  1'  can  be  expressed  as  follows  in  body 


tensors: 

(Al) 

ir(P,t)  + p/\p,t)  = 

— + / a(t  - s)y  \p,s)ds 

.00 

(A  2) 

•^det  Y(P,s)  = 0 

(-00  < s < t)  ; 

(A3) 

• JlfPjt)  = P(£  - H)  : 

(A4) 

E{I(P,s)}  = 0 

(-«  < s < t)  . 

(Al)  is  the  so-called  'rubberlike  liquid'  constitutive  equation  relating 
the  symmetric  contravariant  stress  tensor  w(P,t)  at  particle  P and 
time  t to  the  values  of  the  reciprocal  Y ^ of  the  symmetric  covariant 


metric  lensor  V(P,s)  at  limes  In  llic  interval  -«><S£t  (see  (A12)  bck>w). 

The  mntcrlnl  properties  arc  determined  by  the  nonnegative  constant  and 

the  nonnegatlvc  constants  a,,  . . . ,a  • t,,  . . . , t In  the  *memor\'  function* 

1’  ' m’  1’  ’ m 

m 

(AS)  a(t)  = ^ a exp(-l/T  ) (a  > 0,  t > 0)  . 

r=l  ^ r r r 

(See  (A  26)  and  the  subsequent  explanation;  for  other  possible  constitutive 
equations  see  [1]  and  [5].) 

If  the  integral  term  were  omitted,  (Al),  with  the  constant  volume 
condition  (A2),  would  describe  an  incompressible  Nev/tonian  liquid  of 
viscosity  tj  and  could  be  used,  with  the  stress  equation  of  motion  (A3), 
to  derive  the  Navier-Stokes  equations:  there  would  be  no  elastic  recover^' 
possible.  £ and  H are  contravariant  vectors  describi.ng  acceleration 
and  body  force  per  unit  mass,  respectively;  p denotes  the  density, 
p is  a scalar  function  of  P and  t , of  the  nature  of  a hydrostatic 
pressure,  introduced  in  conjunction  with  the  incompressibility  condition 
(A2).  ^ in  (A3)  denotes  the  covariant  derivative  operator  formed  with 
Y(P,t),  and  the  dot  denotes  contraction  [8,  p.  193]  . (A4)  expresses 


the  vanishing  of  the  fourth  rank  Riemnnn-Christoffel  curvature  tensor  R 
constructed  with  Y(P,s);  this  expresses  the  fact  that  the  body  manifold 
is  Euclidean  at  each  instant  s and  so  admits  a body  coordinate  system 
tliat  is  instantaneously  rectangular  Cartesian  at  s [8,  p.  202], 

Equations  (Al  - 4)  are  sufficient  in  number  to  determine,  in  principle,  • 

the  unknown  variables  w , y,  and  p when  the.rem-‘»ining  quantities  j 

1 

(together  with  suitable  boundary  and  Initial  conditions)  are  given.  j 
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When  referred  to  on  arbitrary  body  coordinate  system  B : (P)  — IR  , 
(Al)  and  (A2)  yield  equations 

(A6)  + p(e,t)Y*^(e,t)  = -T,  + / aft  - s)Y^^(e,s)ds  , 


(A7)  — det  Yjjfi.s)  = 0 (-«><s<t), 

1 2 

where  4 ^Is  and  in  similar  contexts  is  short  for  (4,4  ,4'), 

coordinates  of  P in  B;  Y^^,  Yjj  denote  components  of  tt, 

Y \ and  Y,  respectively,  i,j  =1,2,3.  For  an  arbitrary  B,  (A3) 

yields  a complicated  set  of  equations.  One  can,  however,  always  choose 

a B that  is  Cartesian  at  the  instant  t at  which  (A3)  applies  (i.e. , B 

Is  such  that  y -(4,1)  is  independent  of  ^ for  i,j  = 1,2,3),  and  a 

3 

space  coordinate  system  C : (Q)  -►  R that  is  rectangular  Cartesian; 

(A3)  then  yields  the  following  three  partial  differential  equations: 


li 

j,k=i  dC  34^ 


- X^)  (i=  1,2,3). 


The  motion  of  the  body  is  now  described  by  the  three  equations 

(A9)  x^  = f^|,s)  (i  =1,2,3), 

where  denote  the  coordinates  in  C of  the  place  0 occupied  by 


particle  P at  time  s.  X 
body  force  per  unit  mass. 


denotes  the  components  in  C of  the  external 


(A4)  yields  a very  complicated  set  of  nonlinear,  second  order  partial 
differential  equations  in  Yjj(4,s)  whose  solution  is  of  the  form 
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(AlO) 


k=l 


(ij  =1,2.3) 

oi  ar 


for  arbitrary  B and  rectangular  Cartesian  C [3,11]  . We  may  thus  use 

the  three  functions  f^C,s)  in  place  of  the  six  functions  s)  (-  y..(i,s)) 

as  unknov/ns.  On  using  (AlO)  and  the  matrix  equation 

(All) 


(y^^^.s)]  =[v..{t,s)l'^ 


we  may  express  {A6)  and  (A7)  in  terms  of  -n  , f^,  and  p;  on  substituting 
the  resulting  expressions  for  into  (AS),  we  finally  obtain  three  non- 

linear, partial-integrc-differential  equations,  which,  with  the  single 
equation  resulting  from  (A7),  yields  a set  of  four  equations  for  the  four 
unknown  functions  f^,  p:  the  independent  variables  are  c^,  s. 

The  final  equations  are  nonlinear  in  f^,  p although  the  aibberlike 
liquid  constitutive  equation  (Al)  is  linear  in  the  tensors  ir,  Y Tlie 
nonlinearity  comes  from  the  constant  volume  condition  (A2)  and  from  the 
zero-curvature  condition  (A4)  whose  solution  (AlO)  is  quadratic  in  the 
unknown  functions  f^.  The  nonlinearity  arising  from  the  products  in  the 
left-hand  side  of  (AS)  can  be  removed  trivially  by  choosing  B to  coincide 
with  C at  time  t,  so  that  f^(4,t)  =4^  and  = 6^^  at  time  t. 

A very  considerable  simplification  cf  the  above  equations  is  obtained 
for  flow  histories  which  are  homogeneous  (or  uniform)  under  conditions  in 
which  the  inertial  and  body  force  terms  on  tlie  right-hand  side  of  (A3)  can 
be  neglected.  Such  histories  are  of  little  or  no  interest  In  classical 
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hydrodynamics  (whore  the  constitutive  eqUv-\tions  arc  given  by  (Al)  with 
a(t)  H 0)  but  are  of  fundamental  importance  in  polymer  rheology  where 
highly  viscous  molten  plastics  can  be  subjected  to  uniform  elongation  in 
filament  form  or  to  two-way  stretching  in  sheet  form;  results  of  carefully 
controlled  experiments  of  this  type  can  be  used  to  test  the  applicability  of 
constitutive  equations  such  as  (Al).  A flow  history  is  homogeneous  if, 


for  any  two  instants  s,  t,  we  have  V^Y(P,s)  =0  [8,  p.  247],  from 

-1, 


which  it  can  be  shown  that  (P,s)=0;  since  V commutes  with 

the  operators  d/dt  and  a(t  - s).  . .ds,  it  follows  from  (Al)  (taking 

p to  be  independent  of  P,  as  a trial  solution)* that  V^(P,t)  = 0 (show- 
ing that^he  stress  is  homogeneous)  and  hence  also  that  {A3)  (with  the 
right-hand  side  zero)  is  satisfied.  It  also  follows  from  the  above  defini- 
tion of  a homogeneous  flow  history  that  a body  coordinate  system 
B : P ^ exists  that  is  Cartesian  in  every  state,  i.e.  , which  is  such 
that  is  independent  of  i for  all  s [8,  p.  247],  and  hence 

(A4)  is  satisfied.  The  behaviour  in  homogeneous  flow  histories  with  inertial 
and  body  forces  neglected  is  thus  governed  by  (A6)  and  (A7)  with  4 
absent,  and  there  is  no  longer  a need  to  introduce  a space  coordinate 


system  or  the  functions  f^ : one  can  instead  use  v . or  as  the 

'ij 


unknowns,  for  example,  in  the  case  of  problems  involving  the  calculation 
of  free  elastic  recovery:  In  such  problems,  the  flow  history  (and  hence 
Yjj(s))  would  be  specified  throughout  some  Interval  -«>  < s < t^,  say: 
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for  < s < t,  the  stress  would  be  zero,  and  tlie  clastic  recovery  would 

be  determined  by  solving  the  set  (A6)  (wltli  = 0)  and  (A7)  for  p(t) 

and  (t^  < s < t).  These  equations  tlius  form  a simultaneous  system 

of  nonlinear  Volterra  Integrodifferential  equations,  in  which  the  nonlinearity 
arises  from  the  Incompressibility  condition. 

In  this  paper,  we  consider  the  particular  case  of  the  above  in  which 
the  specified  flow  history  is  one  of  simple  elongation  (at  constant  volume). 
The  variable  p can  be  eliminated,  and  the  recovery  behaviour  is  then 
governed  by  a single  nonlinear  Volterra  equation,  which  we  now  derive. 

In  any  B,  the  separation  P^P  at  time  t between  any  two 
neighbouring  particles  P^,  P is  given  by  the  equation 


(A12) 


= EEv,,(j.t)ei‘6s’ 

i J IJ 


where  and  + 64^  sire  the  coordinates  in  B of  P^  and  P. 

Tor  any  two  times  s,  t,  there  are  three  material  lines  through  any  given 
Pp  that  are  mutually  orthogonal  at  s and  at  t;  in  the  strain  s — t, 
infinitesimal  material  line  elements  tangential  to  these  three  material 
lines  at  P^  change  in  length  by  factors  X^(P^,s,t)  which  are  given  by 
the  positive  roots  in  X.  of  the  equation 


(A13) 


The  factors  are  called  'principal  elongation  ratios'.  A flow  is 
'shear  free'  If  there  exists  a body  coordinate  system  B that  Is  always 


1 
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orthogonal,  i.e.,  such  that  = 0 when  1 / J [8,  p.  81],  For  a 
shear-free  flow,  the  principal  clongotion  ratios  arc  given  by  tlic  length 
changes  of  the  coordinate  lines,  and  the  roots  of  (A13)  arc  given  by 


(A14)  t)  (i  = 1,2,3)  . 

For  a shear-free  flow  that  is  homogeneous,  B is  always  Cartesian,  i.e. , 


•r 

the  Yjj  are  Independent  of  | . A shear-free  flow  is  a 'simple  elongation' 


if  two  principal  elongation  ratios,  X.  and  X say,  are  always  equal; 


.1 


the  i -coordinate  lines  are  then  called  directions  of  elongation.  The 


constant  volume  condition  (A7)  reduces  to  the  equation  ^^^2^3  “ ^ 


hence,  for  simple  elongation  at  constant  volume,  we  have 

1 


(A15) 


X,  = X,  = X 2 

2 3 1 


We  now  consider  the  following  problem: 

-00  < s < - t^  Zero  stress:  no  flow;  B rectangular  Cartesian;  hence 


(A16) 


= 0 


» ^ij  Y "^ij 


-tg  < s < 0 


Homogeneous  simple  elongation  at  constant  volume  and 


constant  rate  k,  i.e.. 


(A17) 

0 < s < t 


dX^(-tg,S) 


ds 


Zero  stress;  free  elastic  recovery: 


(A18) 


= 0 . 


And  hence  X^  are  Independent  of  Pg. 
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Wo  wish  to  calculate  y^j(6|S)  for  0 < s £t.  As  a trial  solution, 

It  Is  reasonable  to  suppose  that  tlic  elastic  recovery  will  involve  a homo- 
geneous simple  elongation  at  constant  volume  with  the  |^-coordinatc  lines 
again  as  directions  of  elongation  (or  contraction).  For  convenience,  we 
write 

(A19)  y(s)  =\j(-tQ,s). 

Since  the  entire  flow  history  is  a homogeneous  simple  elongation  at 
constant  volume,  we  have,  from  (All),  (Al4),  (A15),  (A16),  and  (A19), 


(A24) 


- / a(t  - s)r(y(t),y(s))as  (t  > 0)  , 

» 00 

where 

(A25)  (1  = 3n,  F(y,z)  = (yVz^)  - z . 

These  are  tlie  equations  used  in  the  text  above.  Finally,  we  add  some  brief 
remarks  about  the  physical  basis  and  applicability  of  tlie  rubbcrlike 
liquid  constitutive  equation  (Al)  which  has  been  discussed  elsewhere 
[8,  pp.  143,  223-236] . 

(Al)  has  been  derived  from  two  different  molecular  theories:  the 
'bead-spring'  theory  of  Rouse  and  Zimni  for  very'dilute  solutions  of  deform- 
able long  molecviles  in  an  incompressible  Newtonian  solvent  of  viscosity 
T),  and  the  network  theory  of  Green  and  Tobolsky,  Yamamoto,  and  Lodge 
which  is  developed  for  concentrated  polymer  solutions  and  undiluted  or 
molten  polymers.  It  is  curious  that  two  different  molecular  theories 
should  yield  constitutive  equations  of  the  same  form,  but  the  reason  for 
this  is  known:  the  differences  between  the  tv/o  sets  of  equations  at  Lhc 
molecular  level  do  not  survive  the  averaging  process  used  to  go  from  the 
molecular  quantities  to  the  macroscopic  quantities  tt  and  ^ which 
appear  in  the  constitutive  equation  (/\1)  [9]  . The  memory  function  constants 

a T are  specified  in  terms  of  three  unknown  constants  by  the  bead- spring 

r’  r 

theory  but  are  not  specified  by  the  network  theory,  which  also  leaves  t) 
unspecified. 
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According  to  the  network  theory,  the  inlcgi\nl  term  in  (Al)  arises  from 
the  tlicrmal  motion  of  a network  composed  of  long,  deformable  polymer 
molecules  temporarily  linked  together  at  o few  points  called  entanglements 
or  temporary  junctions  which  are  assumed  to  bo  created  and  lost  ot 
const->i\t  rates  which  are  unaffected  by  the  flow  history.  The  concentration 
N(t)dt  of  network  strands  which  were  created  in  the  interval  {0,dt) 
and  are  still  in  the  network  at  time  t (a  'strand'  being  that  part  of  a 
polymer  molecule  lying  between  two  consecutive  junctions)  is  given  by  an 
equation  of  the  form 


m 


(A  26) 


N(t)  = ^ C exp(-t/T  ) 
r=l  ^ 


where,  for  simplicity,  it  has  been  assumed  that  the  set  of  all  strands 

can  be  sorted  into  m subsets,  labelled  1,  2,  . , . ,m,  such  that,  in  the 
th  , , , 

r subset,  all  strands  were  created  at  the  same  rate  C (per  millilitre) 
and  have  the  same  probability  1/t^  per  second  of  leaving  the  network 
[10]  . The  memory  function  in  (Al)  is  given  by  the  equation  a(t)  = kTNt'tf, 
where  k is  Boltzmann's  constant  and  T is  the  absolute  temp^a'fure. 
Thus  a^  = kTC^  > 0,  and  (A5)  is  proved. 

According  to  the  network  theory,  then,  it  followy^hat  a(t)  > 0, 
because  there  is  always  a nonzero  concentratiop^f  strands  of  age  t, 
and  that  a'(t)  < 0 because  strands  of  a^#*^ t(>  0)  can  only  be  lost  and 
not  created:  strands  are  created ^d^age  0 only.  It  also  follows  from 
(AS)  that 


/ 


/ 
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(A27) 


is  nondccrcasing 


(k  = 0,1,2,..  .)  . 


and  a^^^  is  nondccrcasing J 

denotes  the  k*^  derivative  of  a(t).  ■(A27)  represents  the 
properties  of  a(t)  some  of  which  are  used  in  the  present  analysis. 

The  constitutive  equation  obtained  by  putting  rj  = 0 in  (Al)  leads 
to  the  'reduced'  equation  (1.6)  and  has  been  tested  for  'Melt  1'  by  comparing 


the  predlct^is  with  results  of  a series  of  experiments  performed  by 
^ei^sner  [13,15] . The  constants  in  (A5),  with  m = 5,  were  chosen  to 
fit  stress  growth  data  in  simple  elongation  at  low  rates.  (Al)  (with  = 0) 
then  gave  good  agreement  with  stress  growth  data  in  simple  elongation  at 
higher  rales,  with  elastic  recovery  data  following  elongation  and  following 
shear,  and  with  stress  growth  .data  in  shear  flow  [8,  pp.  225-231;  2;  4], 
provided  that  the  total  strain  from  rest  was  limited  to  moderate  values; 
at  higher  strains,  there  was  serious  disagreement  between  theory  and  data: 
the  predicted  stresses  and  the  predicted  recoveries  v/ere  greater  than  the 
observed.  The  present  analysis  of  the  elongational  recovery  problem  shows 
that  inclusion  of  the  term  in  q leads  to  a reduction  in  the  predicted 
recovery,  which  is  in  the  right  direction  to  give  better  agreement  with 
experiment.  The  term  in  q has  been  added  to  represent  the  possible 
effect  of  the  presence  of  a viscous  solvent  (in  the  case  of  a concentrated 
polymer  solution)  or  of  polymer  of  low  molecular  weight  (in  the  case  of  a 
molten  polymer).  The  possible  effects  of  such  a term  are  also  of  some 


Interest  In  connection  with  certoin  'fast-stroJn*  tests  of  the  Gousr.ljn 
network  hypothesis  which  have  recently  been  proposed  as  a possible 
method  of  testing  certain  of  the  network  theory  assumptions  wlion  separated 
from  the  others  [8,  pp.  231-236;  12).  It  is  recognized,  however,  that 
other  modifications  to  (Al)  are  requiied  if  belter  agreement  between  all 
the  predictions  and  data  referred  to  above  is  to  be  obtained  [ M)  . It 
should,  perhaps,  be  added  that  the  homogeneous  elongation  with  neglect 
of  inertial  and  body  forces  treated  in  the  present  analysis  repre.sents  a 
reasonable  idealization  of  the  conditions  of  Meissner's  elongation 

5 

experiments  on  'Melt  1':  a long  filament  of  high  viscosity  (5  x 10  poise)  was 
floated  on  a bath  of  an  inert  oil,  and  the  homogeneity  of  elongation  was  always 
checked  by  weighing  samples  into  which  the  filament  was  cut  after  elongation; 
the  variation  of  elongation  ratio  along  the  filament  was  about  3^^  or  less  [ 15] . 


Appendix  B : Upper  and  lower  solutions 

Our  purpose  Is  to  collect  the  results  on  IntcgrodlfforcntiaJ  incquulltics 
needed  In  the  proof  of  Tlicorcm  8.  These  arc  rather  similar  to  classical 
results  of  this  type  for  ordinary  differential  equations  and  Voltcrra  equations 
developed  e.g.  in  [6]  and  [19] . However,  our  situation  is  sufficiently 
different  that  it  is  simpler  to  give  an  independent  short  exposition 
of  what  is  used,  rather  than  to  apply  more  general  known  results. 

In  what  follows  let  D u(t)  denote  the  lower  left-hand  Dini 


derivative,  D u(t) 


the  upper  left-hand  Dini  derivative,  D^u(t) 


the  lower  right-hand  Dini  derivative,  and  D u(t)  the  upper  right-hand 
Dini  derivative  of  a continuous  function  u.  When  D^u(t)  = D^u(t), 

we  denote  this  common  value  by  u*  (t),  the  right-hand  derivative  of  u 

• » 

and  similarly  u'  (t)  denotes  the  left-hand  derivative  of  u. 

It  will  be  convenient  to  write  the  initial  value  problem  (1.1),  (1.  2) 
in  the  form 


(Bl) 


where 


-py’(t)  = / F(y(t),y(s))ds  + f (y)(t) 
0 


y(0)  = g{0)  (|ji  > 0:  0 < t < «)  ^ 


(B2) 


f (y)(t)  = / a(t  - s)F(y(t),g(s))ds  . 


-00 


Throughout  this  appendix  we  shall  assume 

(B3)  a(t)>0,  ac  L^D,«'),  F satisfies  (Hj.),  g satisfies  (H^)  . 


369 


Tlic  basic  result  needed  is  (compare  Theorem  1.2.1  of  [6]); 


Proposition  10.  let  tlie  assumptions  (D3)  bo  satisfied.  v,w  c C([  0,  y*: 


>0,  be  given  functions  satisfying  the  following  properties : 


v(0)  < w(0)  , 


-pD  v(t)  >f  aft  - s)F(v(t),v(s))ds+f  (v)(t)  (0  < t < y)  , 
0 


-pD _w(t)  < / aft  - s)F(w(t),w(s))ds  + f^fw)(t)  fO  < t < y)  . 


Then  vft)  < wft)  (0<t<y). 


Proof.  Define  the  set  Z = { t t [ 0,  y)  : wft)  < vft))  . If  Proposition  IB  is 


false  the  set  Z 4 <t)\  let  t^  = Inf  Z.  By  (B4),  t^  > 0 


v(t^)  = wftj),  vft)  < wft)  (0  < t < t^) 


Talcing  h < 0,  Ih  | small,  one  has  vftj  + h)  < wft^  + h)  and 


vftj  + h)  - vftj)  wftj  + h)  - wftj) 

h ^ h 


Taking  the  limit  inferior  as  h — 0 this  implies 


D vftj^)  > D_w(t^)  , 


and  therefore, 


-pD  vftj)  < -pD  wftj)  (p  > 0)  . 


Applying  this  and  vft^)  = w(t^)  in  inequalities  (B5)  yields  the  inequality 


1 1 

/ aftj  - s)F(vft^),  vfs)ds  < / aft^  - s)r(w(t^), w(s))ds 


On  the  other  hnnd,  the  definition  of  end  (B6j,  tonctticr  wlU) 
Uic  OGSuinplJon  < 0 in  Implies  that 

r(v(t^),  v(s))  > F(\v{t^),  w(g))  |0  < s < , 

so  that,  since  a(t)  > 0, 


(BS) 


1 

J a(t  - s)F(v(t  ),v{s))ds  > / a(t  - s)F{w(t,) . w{s))ds  . 

n ^ 1 i ’ 


1 ‘1 

a(tj  - s)F(v(t^),v{s))ds  > j 
0 0 

Ihus  (ES)  contradicts  (B/)  and  the  set  Z =0,  This  proves  Proposition  IB. 

Definition.  We  shall  sav  that  w ^ ^ upper  solution  of  the  initial  value 

problem  (Bl)  ^ 0 < t < y if  and  only  if  wt  C([^i,Y)jB),  w{0)>g(0),  (t) 

exists  on  ( 0 , y)  and 

t 

-pw'(t)  < J aft  - s)F(vv(t),  \v(5))ds  + f (\v)(t)  (0  < t < y)  • 

0 ® 

A similar  definition  holds  for  a lower  solution  with  the  ineeualities  reversed. 
We  remark  that  as  a consequence  of  Theorem  3 the  solution  of  the 

reduced  equation  (1.6)  (recall  that  it  was  p.-oved  in  Theorem  3 that 
<!>Q(t)  < 0 (0  < t < CO  ))  is  a lower  solution  of  (1. 1)  (or  (Bl))  on  C < t < oo  . 

The  main  result  for  the  application  in  Tneorem  8 is; 

Proposition  2B.  Let  the  assumptions  (B3)  be  sa^tijfieji.  Let  6 be  the 


In  Theorem  8 one  takes  v(t)  = <)>Q(t),  where  4>q  Is  the  solution  of  (1.6), 
and  one  shows  that  w given  by  (10*  1)  is  an  upper  solution. 


Proof  of  Proposition  2B.  We  shall  prove  the  second  inequality  in  (B9); 
the  first  is  proved  in  a similar  way.  Since  w(0)  > g(0)  the  result  follows 
directly  from  Proposition  IB  with  v replaced  by  4),  D_v  by  <})',  and 
D_w  by  w'^(t)  (and  one  uses  Lemma  1.2.2  of  [ 6]  ) • 


I 
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VIBRATIONS  OF  A HELICOPTER  ROTOR  BLADE  USING 
FINITE  ELEMENT-UNCONSTRAINED  VARIATIONAL  FORMULATIONS 

J.  J.  Wu  and  C.  N.  Shen 
Benet  Weapons  Laboratory 
Watervliet  Arsenal 
Watervliet,  New  York  12189 


ABSTRACT.  In  the  past  several  years,  a numerical  method  has  been 
developed  which  is  a generalized  Rayleigh-Ritz  - finite  element  discretiz- 
ation using  the  combined  concept  of  Lagrange  multipliers  and  adjoint 
variables.  This  approach  enables  one  to  deal  with  problems  associated  with 
nonconservative  forces,  coupling  effects  and  all  types  of  boundary  conditions 
in  a routine  fashion;  and  it  appears  promising  in  solving  the  vibration  and 
dynamic  stability  problems  associated  with  the  complicated  equations  of  a 
helicopter  rotor  blade.  This  paper  presents  the  first  application  of  the 
general  method  to  the  vibration  problem  of  such  a rotor  blade. 

The  basic  differential  equations  in  this  paper  are  taken  from  the 
linear,  but  fully  coupled  set  developed  by  Houbolt  and  Brooks  in  1956. 

These  equations  are  further  reduced  to  a simplest  possible  case  and  yet 
still  containing  the  coupling  of  flap  and  root  torsion  modes.  An  uncon- 
strained, adjoint  variational  statement  has  been  established  which  is  both 
the  necessary  and  sufficient  condition  for  the  coupled  differential  equa- 
tions and  some  general,  but  physical  meaningful  boundary  conditions.  The 
finite  element  matrix  equations  are  tlien  derived  from  this  variational 
statement  illustrating  the  way  that  coupling  terms  could  be  handled  in 
general . 

The  numerical  results  from  some  demonstrative  examples  show  that 
instability  of  flutter  can  occur  in  the  range  of  operational  rotor  speed 
due  to  the  coupled  motion  of  flapping  and  root  torsion  without  any  aero- 
dynamic force,  if  the  torsional  spring  Coi"  ^he  pitch  control  link)  is 
not  sufficiently  stiff.  This  instability  does  not  appear  to  have  been 
reported  previously. 

1.  INTRODUCTION.  An  analytical  investigation  on  vibrations  and 
dynamic  stability  of  helicopter  rotor  blades  usually  consists  of  two  phases 
(1)  the  derivation  of  the  governing  differential  equations  to  include  para- 
meters and  variables  considered  physically  important,  and  (2)  the  formulation 
of  solutions  for  the  equations  derived  and  data  interpretations.  Ihis  paper 
deals  with  the  second  phase  of  such  an  investigation. 

Due  to  the  slenderness  of  a helicopter  rotor  blades,  its  aerodynamic 
cross-section  and  the  requirements  on  the  craft’s  maneuverability,  tliere 
are  a large  number  of  interacting  parameters  and  the  resulting  differential 
equations  are,  as  a rule,  nonlinear,  coupled  in  terms  of  field  variables. 

In  addition,  the  aerodynamic  forces,  coriolis  forces  due  to  rotation  of  the 
blade  arc  nonconservative  in  nature  and  the  effects  due  to  structural  damp- 
ing and  various  boundary  conditions  must  be  evaluated. 


Considerable  attention  has  been  given  recently  to  the  derivation  of  a 
consistent  set  of  nonlinear  differential'  equations  together  with  aero- 
dynamic forces.  This  fact  is  amply  demonstrated  by  the  work  of  Friedman 
and  Tong  [1]  and  that  of  Hodges  and  Ormiston  [2].  . In  both  references  [1] 
and  [2],  brief  reviews  on  earlier  work  on  helicopter  blade  equations  can 
be  found.  As  for  obtaining  solutions  to  these  equations  there  does  not 
appear  to  exist  a general  and  efficient  method  to  deal  with  the  difficulties 
associated  with  nonlinearities,  nonconservative  forces,  coupling  terms, 
various  boundary  conditions,  damping  effects  and  periodic  excitations.  For 
example.  Miller  and  Ellis  [3],  Ham  [4]  and  Friedman  and  Tong  [1]  have 
obtained  approximate  solution  by  including  in  their  solution  formulation 
only  the  lowest  modes  of  vibrations,  Hodges  and  Ormiston  [2,5]  employed 
Galerkin  technique  in  their  numerical  examples.  One  of  the  disadvantages 
of  this  approach  is  its  inability  to  handle  general  boundary  conditions. 

Using  the  combined  concept  of  adjoint  variable  and  Lagrange  multipliers, 
variational  statements  can  be  established  for  a wide  range  of  linear  prob- 
lems with  nonconservative  forces  and  very  versatile  boundary  conditions  [6] . 
Thus  a generalized  Rayleigh-Ritz  approximation  scheme  can  be  established 
for  the  obtaining  of  solutions  of  these  otherwise  difficult-to-solve  prob- 
lems. In  conjunction  with  finite  element  discretization,  this  approach 
has  been  amply  demonstrated  in  such  applications  as  nonconservative  stabil- 
ity, damping  effects  and  very  general  boundary  conditions  [7,8,9].  In  the 
present  study,  the  solution  formulation  is  limited  to  the  blade  vibration 
considering  only  the  coupling  of  flapping  and  root  torsion.  Although  the 
present  method  can  conceivably  be  extended  for  solutions  of  nonlinear  prob- 
lems, it  is  desirable  first  to  have  a thorough  understanding  to  solutions 
of  linear  problems.  For  this  purpose,  one  can  go  back  two  decades  and  use 
the  equations  consistently  derived  by  Houbolt  and  Brooks  in  1956  [10].  The 
original  set  of  equations  was  derived  for  elastic  distributed  torsion.  It 
can  be  adapted  to  model  root  torsion  if  proper  boundary  conditions  are 
introduced.  This  is  shown  in  Sections  1 and  2.  The  physical  justification 
for  emphasizing  root  torsion  over  the  distributed  torsion  was  due  to  the 
pitch  control  link  at  the  inboard  end  of  the  blade  and  was  used  by  Miller 
and  Ellis  [3]  and  again  by  Ham  [4].  It  should  be  clear  that,  in  the 
present  formulation,  to  include  distributed  torsion  is  simply  a matter  of 
increasing  the  number  of  degrees-of-freedom  of  the  discrete  system.  The 
basis  of  the  solution  formulation  and  the  technique  of  handling  the  coupling 
terms  are  given  in  Sections  3 and  4.  Finally,  the  numerical  results 
obtained  indicate  that  "flutter  instability"  can  occur  simply  due  to  the 
coupling  effect  considered  without  any  aerodynamic  loads. 

2.  STATEMENT  OF  THE  PROBLEM  - DIFFERENTIAL  EQUATIONS.  As  a first 
step  to  demonstrate  the  application  of  the  unconstrained  variational  - 
finite  element  formulation  to  helicopter  rotors,  the  vibration  of  a rotor 
blade  considering  the  coupling  of  flap  and  root  torsion  modes  of  motion 
is  analyzed  (Figiire  1).  For  this  purpose,  the  linear  set  of  equations, 
derived  by  Houbolt  and  Brooks,  including  the  coupling  flap  and  distri- 
buted torsion  is  rewritten  here  [10]; 
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+ n*ra(k^_  - k*,  + ee-)<l)  + mk*  ^ + mew  = M (2) 

m2  ml  m 


where  w = w(x,t)  and  ((>  = (|)(x,t)  denote  the  flapping  deflection  and  distri- 
buted torsion  of  the  rotor  blade  respectively.  A prime  (')  denotes  differ- 
entiation with  respect  to  x,  the  coordinate  along  the  blade  elastic  axis 
and  a dot  (•)>  differentiation  with  respect  to  the  time  t.  Other  symbols 
are  defined  in  the  following  and  are  consistent  with  the  notation  in 
reference  [10]. 

El  = flexural  rigidity  of  the  cross  section. 

GJ  = torsional  rigidity  of  the  cross  section. 

Z = length  of  the  blade. 

e = chordwise  distance  between  elastic  axis  (E.A.)  and  centre 

of  gravity  (C.G.)  of  a cross-section,  positive  if  C.G.  is 
ahead  of  E.A. 

= chordwise  distance  between  E.A.  and  centre  of  tensile  area 
(C.T.)  of  a cross  section,  positive  if  C.T.  is  ahead  of  E.A. 

Oq  = chordwise  distance  at  the  root  between  E.A.  and  the  axis 

about  which  the  blade  is  rotating,  positive  if  E.A.  is  ahead. 

ky^  = polar  radius  of  a gyration  of  the  tensile  area  of  a cross- 
section  w.r.t.  E.A.  in  a cross-section. 

kjjjj  = radius  of  gyration  of  the  total  area  of  a cross-section  about 
the  major  neutral  axis  (axis  1-1  in  Figure  2). 

kjj^  = radius  of  gyration  of  the  total  area  of  a cross-section  about 
an  axis  perpendicular  to  the  major  neutral  axis  and  through 
E.A.  (axis  m2-m2  in  Figure  2). 

k = polar  radius  of  gyration  of  the  total  area  of  cross-section 

about  E.A.  (k*  = k*,  + k^-  ). 

m ml  m2 

= blade  angular  velocity. 

^ 2 

T = T(x)  = / mil  xdx,  which  is  the  tensile  force  at  location  x. 

X 

= aerodynamic  lift  per  unit  length  of  the  blade. 

M = aerodynamic  torque  per  unit  length  of  the  blade. 
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In  reference  [10],  a more  general  set  of  linear  equations  with  fully 
coupled  flap,  lag  and  distributed  torsion  modes  of  motion  were  derived. 

The  coupling  terms  have  been  shown  to  be  due  to  the  noncoincidence  of  the 
elastic  centre,  centre  of  gravity  and  tension  centre,  the  centrifugal  force 
and  the  built-in  angle  of  twist.  For  a rotor  blade  without  the  built-in 
twist  angle,  the  lag  mode  of  motion  is  uncoupled  from  the  general  set  of 
equations  and  the  remaining  coupled  equations  are  Eqs.  (1)  and  (2) 
considered  here. 

Since  only  the  "free  vibration"  of  the  rotor  blade  with  a coupling 
between  flexural  and  root  torsion  is  considered  in  this  paper,  the  terms 
due  to  aerodynamic  forces  are  set  to  zero  and  the  torsional  displacement 
is  only  a function  of  time  and  not  a function  of  x.  Thus 

M = = 0,  ((.  = (t)Ct)  (3) 

and  Eqs.  [1)  (2)  reduce  to  the  following: 

EIw""  - (Tw')'  + mw  - n^me(|)  + me$  = 0 (4) 

+ ee^)(|)  + mk*  (b  - Te.w" 
m2  ml  m A 

+ mn^exw'  + mew  = 0 (5) 

In  Eqs.  (4)  and  (5),  it  is  also  assumed  that  the  blade  has  constant 
E,  I and  m throughout  its  length.  To  simplfy  solution  formulations  as 
much  as  possible,  Eqs.  (4)  and  (5)  will  be  transformed  into  dimensionless 
forms  and  appropriate  dimensionless  parameters  will  be  introduced.  This 
process  will  also  facilitate  parametric  studies. 

Let 


w = ^ . $ = <t> 


where  the  constant  c has  a real  time  dimension  and  will  be  defined  later  in 
Eqs.  (12).  Eqs.  (4)  and  (5)  become 


and 


EU  3**^ 

£"  ax'* 


mn*k*<j)  + 


mk 


c 


1 3 

(T  1^) 

m£ 

a^w 

- mll*e^  + 

me  3*$  _ Q 

(7) 

£"  ax 

ax 

c^ 

at* 

c*  at* 

m 9^^ 

Te^L 

9^  w 

mil* 

me£  9*w  _ n 

(8) 

2 3£2 

3i* 

£ 

3x 

c*  at* 
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where 


-Ki 


Multiplying  Eq.  (7)  by  , one  has 

El 


A ni*LL  A Ai  £ A 2021 

ai"  ■ ax  4i  ax^  * ei  at*  " ei  t c* 


^ m^Ae 
EI  c*  I at* 


2 

Multiplying  Eq.  (8)  by  S—  ^^3 

in  K. 


2«2  •ii  T j.  ILsl  _ £!i  a^  - , e . aw  ^ e a*w 

C n J^*  $ g£2  jj,£^2  £ gjj2  ^ g * 3x  £ at*  “ ° 


Now  let 


c*  = 

EI 


'■f-'A  = T-'o  A 


k = li  iL  = ^ 

1 ^ i 


n = cn 

TW  = = in*Ci-i*) 

EI  2 


Eqs.  (10)  and  (11)  then  become 


~ - (T(i)^)  + ^ - n*i^  ♦ i = 0 

ax*  ax  ax  at*  a€* 


and 


+ k*  - T(x)  e»  + fi^ex  ^ + e ^ = 0 

m g£2  " ^ A g-2  gj  g2^2 


(14) 


With  all  quantities  in  Eqs.  (13)  and  (14)  in  dimensionless  forms, 
one  can  omit  the  bars  altogether  and  write: 


w’"'  - (Tw')'  + w - n*e(J)  + e^  = 0 


(IS) 


and 


+ k*  $ - Tey^w"  + + ew  = 0 (16) 

Furthermore,  it  is  assumed  that 

w(x,t)  = w(x)e^^  (17) 

Ht)  = 4.0^^ 

Thus  the  final  set  of  equations  upon  which  the  present  solution  formula- 
tions are  based,  is  the  following: 

w""  - (Tw*)'  + X*w  - n‘e4>  + X^eiJ)  = 0 (18) 


and 

ft^k^4)  + X*k*  (J.  - TeyyW"  + fi^gxw'  + X*ew  = 0 (19) 

3.  AN  UNCONSTRAINED  VARIATIONAL  STATEMENT  AND  BOUNDARY  CONDITIONS. 

Some  of  the  unique  features  of  the  unconventional  va’riational  formulation 
are  that  all  the  boundary  conditions  are  natural  boundary  conditions  and 
that  the  set  of  the  differential  equations,  together  with  all  the  boundary 
conditions,  is  the  direct  consequence  of  a variational  statement  and  vise 
versa.  The  construction  of  such  a variational  statement  is  simply  a process 
of  integration-by-parts  from  a bilinear  functional  of  the  original  differ- 
ential equations  multiplied  by  the  variation  of  the  adjoint  field  variable, 
into  some  other  bilinear  functional  with  lowest  possible  derivations  of 
both  the  original  field  variable  and  the  variations  of  the  adjoint  variable. 
With  a proper  choice  of  the  generalized  Lagrange  multipliers,  any  physical 
meaningful  boundary  conditions  consistent  with  the  physical  meaning  of  the 
given  differential  equations  themselves  can  be  resulted  from  the  variational 
statement  as  natural  boundary  conditions.  This  general  process  was  treated 
elsewhere  [7]  and  will  not  be  repeated  here.  Presently,  an  unconstrained 
variational  statement  will  be  given  which  leads  to  the  original  differential 
equations  and  a set  of  a very  general  boundary  conditions. 
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I,  = /^[w"w*"  + Tw'w*'  + x*ww*  + e(X*  - 
0 


+.  kjw(0)w*(0)  + k2W'(0)w*’(0)  (20c) 

and 

I_  = / [(f2^k*  + X^k^)4)({i*  + eAT'w'(j)*  + e(n^xw'4)*  + X^w4i*)]dx  + k-dxj)* 

2 0 ra  A i ^20d) 


where  a star(*)  denotes  the  adjoint  variables  and  the  variations  are  totally 
unconstrained.  The  Lagrange  multipliers  k^,  k2  and  k3  are  the  spring  con- 
stants, in  dimensionless  form,  for  deflection,  bending  and  torsion  at  the 
hub  (x  = 0)  respectively.  To  show  that  the  unconstrained  variational  state- 
ment of  Eqs.  (20)  leads  to  the  original  differential  equations  and  the 
necessary  boundary  conditions,  one  considers 

(61)  . = 0 (21a) 

w»9 


where  (61)^^^^  means  taking  the  variation  of  I with  w,(j)  not  varied.  Thus 

(61)  = (6Ii)  + (612)^  . ? 0 

w,9  w,4)  ^ w,(j) 


(21b) 


(6lj)^  ^ = J^[w'’6w*"  + Tw'6w*’  + X*w6w*  + e(X*  - n*)^6w*]dx 

+ kjw(0)6w*(0)  + k2w’(0)6w*'(0)  (21c) 

and 

(6l2)w.4.  " /q  ^ X'k*)(^6(|)*  + e^T'w’6(|.* 

+ e(f2*xw'  + X*w)6(|)  ]dx  + k2(})6(j)*  (21d) 


Performing  integration-by-parts,  one  arrives  at 


and 


- (TW)'  + X*w  + e(X^  - n^)({)]6w*dx 


+ w"(l)6w*'(l)  - [Wifi)  - T(l)w' (l)]5w*(l) 
- [w"(0)  - k2w' (0)]6w*' (0) 

+ [w"'(0)  - T(0)w'(0)  + kiw(0)]6w*(0) 


(22a) 


(fil-j)  = / [(«*k^  + A*k^)(j)  - e.Tw"  + e(fi*xw'  + X*w)]6())*dx 
^ w,(j)  Q m A 

+ [e^T(l)w«(l)  - e^T(0)w«(0)  + 


(22b) 


Thus  Eqs.  (21)  is  the  necessary  and  sufficient  condition  for  the  following 
differential  equations  and  boundary  conditions; 


D.E. 


w""  - (Tw')'  + X^w  + e(X*  - 0^)(|)  = 0 


(23a) 


(n*k*  + X^k2)())  - e^^Tw"  + e(n*xw'  + X*w)  = 0 


(23b) 


B.C. : 


and 


w"(l)  = 0,  W"  (1)  = 0 - 

w"(0)  - k2W*(0)  = 0 

W"  (0)  - ^ w'  (0)  ♦ kjw(O)  = 0 


- o^w' fOl  + 


n"W  (0)  + k3(|)  = 0 

Note  that  in  Eqs.  (24)  the  fact  that 

T(l)  = 0 

and  . 

T(0)  = i-  n* 


(24a, 24b) 
(24c) 

(24d) 


(24e) 


(25) 


has  been  used. 
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It  is  observed  that  the  boundary  conditions  of  Eqs.  (24c)  and  (24d) 
represent  very  general  support  condition's  at  the  hub.  The  special  case 
kj  = «o  and  k2  * 0 corresponds  to  a hinged  blade,  while  the  case  kj  = 
k2  = “>  corresponds  to  a hingeless  blade.  These  tvyo  special  cases  are  the 
only  ones  considered  in  the  literature.  The  boundary  condition  (24 e) 
indicates  a coupling  between  the  flexural  motion  w and  root  torsion  at 
the  hub  due  to  the  centrifugal  forces  and  the  noncoincidence  of  the  elastic 
axis  and  the  tension  axis. 

4.  MATRIX  EQUATIONS  FROM  FINITE  ELEMENT  DISCRETIZATION.  In  this 
section,  we  shall  briefly  describe  the  formulation  of  the  matrix  equation 
of  the  approximate  solution  from  the  variational  statement  given  in  the 
previous  section.  From  Eqs.  (21),  one  can  write 


or» 


= /^(w"6w*"  + Tw'6w*'  - efi*4.6w*)dx 

W,(J)  Q 

+ k,w(0)6w*(0)  + k2w’(0)6w**(0)  + /\w6w*  + e(|)6w*)dx 

0 

+ [n*k*(J)6({i  + e^iT'y,' 6((i*  + efl^xw' 6(j)*]dx  + 


+ X*  f^(k^(p6(p*  + ew6())*)dx 


(26) 


It  should  be  noted  that  Eq.  (26)  is  a quite  general  equation.  Various 
types  of  approximate  solutions  can  be  obtained  depending  on  the  choice  of 
the  coordinate  functions.  The  motivation  of  using  the  finite  element 
discretizations,  which  corresponds  to  the  choice  of  a set  of  piecewise 
analytic  functions,  is  twofold:  for  the  ease  of  extending  this  formulation 
to  problems  of  irregular  geometry  and  for  the  adaptations  to  general  finite 
element  computer  systems. 


In  the  present  formulation,  however,  only  blades  of  uniform  cross- 
sections  will  be  considered  and  the  elements  are  assumed  to  be  of  the  same 
length.  Thus,  one  introduces  a local  (element)  coordinate  5 which  relates 
to  the  global  (entire  blade)  coordinate  x such  that 


i = L(x  - i;i)  (27) 

where  L is  the  number  of  elements  and  i denotes  the  i-th  element.  Using 
the  notation 

wfi)(a  = w[x(5f^^)l.  etc.  (28) 

■ for  simplicity  andjioting  that 

dC  = Ldx 


w' (x)  = Lw^^^'(5),  etc. 
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(29) 


one  then  has,  from  Eq.  (26): 


► 

t 


(61)  =0 

w,4> 

= X {/^L»w(^)"(a6w*(i)'-(Od5 
i=l  0 

+ f^L  T^^5(C)w^^^'(5)6w*'^^\0d^  - /^6w*fi5d5} 

0 0 

+ kjw(^5(0)6w*(^^ (0)  + k2L*w^^^' (0)5w*^^^' (0) 

♦ X /^w(i)(Q6w*fi)(C)dC  + ^ /^6w*^^^(£;)dC} 

i=l  ^ 0 ‘'0 


♦ I {fiV(J)6(f*  f + e,  L /^T^^^'(aw^^^'(5)dC6<|)* 


/ JS.  yoifi  Y ] U(,  ^ u J 

i=l  ^0  ^ 0 


+ + (i  - l)lw^^^'(Qd5  Sd)*}  + k3(})6(|)* 

^ 0 

2 L • ' 

♦ XM  ^ <|>6d)*  + X f /^w^^\5)dC}6<|)* 

^ 0 i=l‘'  0 


where 


T'"'(C)  = - b[C"  + 2(i  - IK  * (i  - D*  - L^l 


At  this  point,  it  is  appropriate  to  introduce  the  shape  function  and 
generalized  coordinates.  Let 


(34) 


= I 6W*^^5^{L®/^a"a"’^d5  - bLt/^^^a’ a’'’''dC  ■»■  2(1  - l)/^5a'a'^d5 

i-1  0 0 0 


+ {L^  - (i  - 1)^}  /^a'a'^d5]W^^^} 
0 


+ 


I 6W  ^ •'  (-  2ebL)  / ad?  <(> 
i=l  ~ O" 


+ kj  a(0)a^(0)  + 6W*(^)  k2L^a' (O)a'^(O) 

+ XM  I /^aa'^d?  + f /^ad?  ()>} 

i=l  - O'-  . 0~ 

+ 6(j)  fi^k^(fi  + 6(j)  <}> 

+ 64)*  ^ 2(e  - e.)bL  /V'”^(C)d?  + (i  - l)/^’'^d? 
i=l  0 0~  ■ 

+ 64)*[k24>  + I f /^a'^d?  (36) 

*"  i=l  0~ 


^ « T T Ci) 

+ 64)  I 2(e  - e.)bL  (s^  + (i  - l)r*)  W 

i^l  A > - > 

+ 64)*  (n^k^  + k3)4)  + 64>*  ( I f + k24))  (37) 

i=l  ^ 

where  the  element  matrices  are  defined  as  follows, 

A = /^aa'^d?,  B = /^a'a''^dC,  C = /^a"a"^d5 

0~~  •'  0 ~ 0 

D = /^5a’a'''’dC,  E = /^C^a'a'‘^d5 

o--  ' o-" 

F = a(0)  a’^(O).  G = a' (0)a''^(0) 

p = /^  a dC,  r = /^  a'  d?,  s = a*  d?  (38) 

-p''  " 0 " 0 “ 


The  numerical  values  of  these  matrices  are  given  in  the  Appendix. 
In  terms  of  the  global  coordinates  defined  in  the  following 


One  has  from  Eq.  (37): 


(«I)  A = 0 

w,(}> 


= 6W*’^(K  + X^M)W 


hi  * ^^12 


^21  * ^^21  ^22  * ^^22 


(41) 


The  global  matrices  j , i,j  = 1,2  are  assembled  from  the  element 

matrices  defined  in  Eqs.  (38)  in  the  following  manner. 


6W*^KiiW  = 1 {L^(C  + bB)  + bL(2D  - B - E) 

i_l 

+ 2ibL(B  - D)  - i^bLB}W^^^  + 6W*^^^^{kjF  + k2L^G}W^^^ 

(42a) 

L * T 

6W*'^K^2'f’  = I (-  2ebL  p)(t) 

i=l 

(42b) 

6(t)*K2iW  = 6(1)*  1 2(e  - e^)bL[s’^  - r^  + ir'^]K^^^ 

i=l 

(42c) 

6(j)  K22<1>  = 6(1)* (n^k^  + k2)(l) 

(42d) 

6W*MjjW  = 1 6W*^’‘^^  i A 

i=l 

(42e) 

6W*Mi2(1>  = f P <}> 

(42  f) 

6(1)*M  W = 6(f*  If 

zi~  i=l  ~ ~ 

(42g) 

6^*M22(^  = 64)*  k^  4) 

(42h) 

i 

1 


i 


J 
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Since  6W  = {6W  64>  } in  Eq.  (41)  is  unconstrained,  Eq.  (41)  leads 


directly  to 


(iC  + X^M)W  = 0 


which  is  vhe  final  matrix  eigenvalue  equation  to  be  solved. 

5.  RESULTS  AND  DISCUSSION.  Prior  to  the  presentation  of  the  demon- 
strative  numerical  results,  it  will  be  worthwhile  to  make  some  observations 
on  the  nondimensionalized  differential  equations  (15)  and  (16)  and  the 
boundary  conditions  (24). 

i.  For  e = 0,  the  flapping  motion  w(x,t)  and  the  root  torsion  (j)(t) 
are  essentially  uncoupled.  The  eigenvalue  solutions  of  w reduce  to  that  of 
a rotating  beam  and  agree  well  with  the  available  data  of  Boyce,  DiPrima 
and  Handelman  [11].  The  torsional  vibration  frequency  has  only  one  eigen- 
value solution  and,  if  ey^  = 0 also,  it  varies  linearly  with  the  rotor  speed 

as  expected. 

ii.  For  e 0 and  e^  = 0,  the  motions  are  generally  coupled.  It  is 
observed  that  if  e and  ((>  both  change  sign,  the  equation  remains  unaltered. 
Thus,  as  far  as  eigenvalues  are  concerned,  they  depend  only  on  the  absolute 
values  of  e.  The  solution  of  4>  for  a negative  e,  however,  is  180°  out  of 
phase  compared  with  the  one  for  a positive  e of  the  same  magnitude. 

iii.  For  e ^ 0 and  e^^  0 the  motions  are  generally  coupled.  It  is 

observed  that  if  e,  e,  and  (p  all  three  change  signs,  the  governing  equa- 
tions remain  unaltered. 

Some  demonstrative  calculations  will  now  be  given*.  The  eigenvalue 
X is  generally  a complex  number. 

X = + iXj  (44) 

From  equations  (17)  and  (44),  it  is  clear  that  the  system  is  unstable  of 
divergence  when  Xj^  is  nonzero  positive  and  Xj  = 0.  IVhen  Xj^  = 0,  on  the 
other  hand,  Xj  (LAMBDA)**  represents  the  nondimensional  frequency  of 
vibration  and  it  can  then  be  plotted  against  the  nondimensional  blade 
rotating  speed  (OMEGA),  as  shown  in  Figures  3 through  7.  When  X is 
complex,  one  of  the  square  roots  of  X^  must  have  nonzero  positive  real 
part.  The  system  is  then  unstable  since  X appears  in  the  equations  only 
as  X^.  The  value  of  X generally  become  complex  as  the  two  branches  of 
the  frequency  curve  coelesce.  The  "critical"  speeds  can  be  located  in 
these  figures  by  noting  these  points  of  coelescence. 

*An  extensive  parametric  study  of  a rotor  blade  instability  in  vacuum  due 
to  the  coupled  flap- (root)torsion  motion  will  be  presented  in  a separate 
paper  forthcoming. 

**The  symbols  appeared  in  the  parentheses  are  those  used  in  Figures  3 
through  7. 


For  these  sample  calculations  here,^  the  parameters  used  are  typical 
for  a medium  size  rotor  blade  of  20  ft.  in  length,  for  example.  It  is 
further  assumed  that  (the  nondimensionalized  quantities) 

e(E)  = 0.003,  e^CEj,)  = 0 

k^l(KMl)  = 0.0025,  kjjj2(KM2)  = 0.0100 

The  boundary  conditions  at  the  hub  are  those  of  a hingeless  blade.  The 
values  of  k]^(K-l)  and  k2(K-2)  are  set  to  10®  as  approximations  to  infinity. 
The  torsional  spring  constant  k3(K-3)  has  been  set  to  zero. 

In  Figure  3,  taken  to  be  0.003.  Thus 

®A  ■®A 

Y (GAMMA)  = ~ = = 1-00  - 

Here  the  lowest  branch  of  Xj  is  essentially  for  the  torsional  motion.  The 
second  and  third  lowest  branches  are  essentially  the  first  and  the  second 
for  flapping  motion.  For  the  range  of  rotor  speed  shown*,  0 ^ ^ 25,  the 

coupling  is  not  sufficient  to  have  instability.  In  the  subsequent  figures 
as  ey^  = 0.0015,  0,  -0.0015  and  -0.003  (and  y = 0.5,  0,  -0.5  and  -1.0), 
the  effect  of  coupling  becomes  more  and  more  severe.  In  Figure  4,  the 
two  lowest  branches  of  eigenvalues  appear  to  draw  closer  compared  with 
those  in  Figure  3.  They  actually  coelesce  in  Figure  5 at  a critical 
speed  about  = 16.5.  As  ey^  continue  to  decrease  (increase)  in  algebraic 
sense  while  holding  e a positive  (negative)  constant  the  critical  speed 
of  flutter  instability  tends  to  decrease  and  thus  the  structure  becomes 
more  critical.  This  is  observed  in  Figures  6 and  7. 
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APPENDIX 


The  numerical  value  of  some  of  the  matrices  used  in  Section  4 are 
given  here. 
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Elasti 

Axis 


Figure  1 


Problem  Configuration.  (Elastic  axis  is  shown  to  coincide 
with  the  axis  of  torsion  in  the  figure.) 


I 


I 

T-A.  = Axis  of  Torsion 
E.A.  = Elastic  Axis  3 

hj 

C.G.  = Centre  of  Gravity 
C.T.  = Centroid  of  Tensile  Area  of  a Cross  Section 


Figure  2.  Pariur.eters  Related  to  Off-sets  of  Variouc  Axis. 
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FIGURE  3. 


Vibration  Frequency  vs.  Speed  of  Rotation  (c^=0.0030,  e=0.0030). 
No  Instability  in  the  Range  of  Speed  Shown. 
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nGUR£_^.  Vibration  Frequency  vs.  Speed  of  Rotation  (c^=0.001S,  e = 0.00.i0). 
No  Instability  in  the  Ranpe  of  Speed  ShovN-n. 
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FIGURE  5. 


Vibration  Frequency  vs.  Speed  of  Rptation  (ey^=0,  e=0.0030) 
Flutter  Instability  at  ^=16. 46. 
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FIGURE  6. 


Vibration  Frequency  vs.  Speed  of  Rotation  (e.=-0.0015,  e=0.0030). 
Flutter  Instability  at  0=5.80. 
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FIGURE  7.  Vibration  Frequency  vs.  Speed  of  Rotation  (e^=-0.0030,  c=0.0030). 
Flutter  Instability  at  fi=4.59. 
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ABSTRACT . The  paper  presents  a formulation  for  the  study  of  damping 
effects  in  dynamic  structural  problems  and  a specific  application.  A finite 
element  formulation  is  first  derived  from  the  versatile  unconstrained  vari- 
ational approach.  The  vibration  of  a rotating  beam  is  used  here  as  a con- 
crete example.  Viscous  damping  terms  at  the  support  can  be  present  due  to 
either  local  deflection  or  rotation.  These  terms  can  obviously  affect  the 
frequencies  of  the  rotating  beam.  They  are  easily  incorporated  in  the 
present  formulation  using  the  concept  of  unconstrained  variations.  Numer- 
ical data  will  be  presented  to  demonstrate  the  qualitative  as  well  as  quan- 
titative effects  on  the  vibratory  behavior  of  this  rotating  beam  due  to  such 
damping  terms. 

1.  INTRODUCTION.  The  applicability  of  the  unconstrained  adjoint  vari- 
ational statement  in  solving  nonconservative  stability  problems  has  been 
shown  in  a series  of  articles  [1-4].  The  problems  involved  the  stability 
of  beams  or  columns  subject  to  concentrated  or  distributed  tangential  loads. 
The  problems  are  solved  by  finding  the  variational  statement  associated  with 
the  differential  equation  and  they  are  rendered  unconstrained  by  incorporating 
the  geometric  boundary  conditions  into  the  variational  statement  through  the 
use  of  Lagrange  multipliers.  With  the  variational  statement  now  available, 
the  problem  is  discretized  and  solved  using  finite  elements.  Various  types 
of  external  forces  and  geometric  boundary  conditions  can  be  handled  using 
the  above  techniques.  It  is  the  purpose  of  this  paper  to  incorporate  into 
the  above-mentioned  formulation  the  effect  of  support  damping  and  to  examine 
its  effect  on  the  solution.  The  specific  problem  chosen  was  that  of  a 
rotating  cantilever  beam  of  constant  cross  section. 

This  problem  was  chosen  for  its  application  to  a simplified  helicopter 
blade  and  although  no  nonconservative  forces  are  considered  the  solution 
technique  outlined  above  is  applicable. 


The  effects  of  support  damping  on  the  vibration  response  of  beams  has 
been  investigated  by  others.  Fu  and  Mentel  [5]  and  Mentel  [6]  considered 
support  damping  due  to  viscoelastic  layers  applied  to  the  ends  of  a beam 
in  its  supports.  The  effect  of  translational  (axial)  damping  was  found  to 
be  of  the  same  order  in  terras  of  energ>’  dissipation  at  the  supports  as  that 
of  material  damping.  Material  damping  effects  were  found  to  stiffen  the 
beam  which  increased  both,  the  resonance  frequency  and  the  energy  dissipa- 


tion at  the  supports.  They  also  found  that  rotational  motion  dominates  the 
damping  properties  if  all  parameters  are  suitably  optimized.  Ruzicka  [7] 
presented  an  evaluation  of  the  resonance  characteristics  of  undirectional 


vibration  isolation  systems  including  directly  coupled  (Kelvin/Voight)  and 
elastically  coupled  (Zener)  damping  elements.  His  results  were  mostly  for 
the  Zener  model  and  he  found  that  resonant  frequencies  of  vibration  isola- 
tion systems  with  viscous  damping  may  increase  or  decrease  with  an  increase 
in  the  viscous  damping  coefficients  depending  on  the  stiffnesses  in  the 
system.  MacBain  and  Genin  treated  support  flexibility  in  a series  of 
papers.  Support  and  material  damping  was  introduced  in  [8].  The  support 
is  viewed  as  a complex  rotational  support  stiffness  based  on  bounds  for 
the  elastic  modulus  found  in  their  earlier  papers.  They  find  that  when  the 
support  damping  constant  is  an  optimum,  the  support  loss  factor  is  also  an 
optimum,  and  system  loss  factor  reaches  a maximum  value.  This  same  value 
of  the  support  loss  factor  is  also  that  which  critically  dampens  the  system 
in  free  vibration. 

The  results  presented  here  show  the  effects  of  support  damping  on  the 
flexural  frequencies  of  vibration  of  the  rotating  beam  with  both  deflection 
and  rotation  flexibility  at  the  support. 

2.  PROBLEM  STATEMENT.  The  geometry  of  the  problem  is  shown  in  Figure  1. 
The  beam  has  a constant  cross  section  of  area  A,  density  p.  Young's  modulus, 

E,  and  moment  of  inertion,  I.  The  beam  rotates  about  an  axis  fixed  at  one 
end  of  the  beam  and  is  flexibly  supported  at  that  end  by  a deflection  spring, 
k^,  and  a rotation  spring,  k2.  Viscous  dashpots,  cj  and  C2,  are  assumed  in 
parallel  to  the  deflection  and  rotation  springs,  respectively.  The  beam 
rotates  at  constant  angular  velocity,  Q.  S(0)  represents  support  reaction. 

The  differential  equation  governing  the  motion  is  given  by  [9] 

U""  - ^ [(£^  - x^)u']'.  P^u  = 0 (1) 

2E1  El 

and  the  boundary  conditions  are 


at  X = 0 


c k 

u"(0)  - — u'(0)  - — u'(0)  = 0 
El  El 

c 

u'”  (0)  4 — U(0)  4 — U(0)  - u'  (0)  = 0 

El  El  El 


at  X = £ 


u"C£)  = 0 
u'"  (£)  = 0 


The  differential  equation  and  boundary  conditions  are  rewritten  using 
dimensionless  variables  and  parameters  defined  by  the  following: 


_ u 
£ 


Cl  = 


. X rt  - Ji^pA£‘' 

£ ' ^ 2EI 

t = [-^ 

pA£ 

Ci£ 

^2 

1/2  ^2 
(EIpA)^^^ 

(EIpA)^/^£ 

ki  - ^2  “ 

El 

k2£ 

El 

,1/2 


(4) 


Time  is  removed  by  assuming  displacements  to  have  the  form 

u(x,t)  = uCx)e^^  (5) 

Then  the  differential  equation  becomes  [dropping  the  bar  symbol): 


u""  - Q[(l  - x^)u'] ' + X^u  = 0 

and  the  boundary  conditions 

r u"(0)  - (Xcp  + k-)u' (0)  = 0 

x = 0 < 

I u"'  (0)  + (Xci  + ki^)u(O)  - Qu'(O)  = 0 
f u"(l)  = 0 

X - 1 

(u"’  (1)  = 0 

The  eigenvalues,  X,  will  be  complex, 

X = Xj^  + iXj 

The  frequencies  will  be  given  by 


u = Xj 


pA£^ 


1/2 


(6) 


(7) 

(8) 


(9) 


(10) 


and  for  this  problem,  Xr  < 0,  i.e.,  the  real  component  of  the  eigenvalue 
is  negative  and  no  instabilities  should  exist. 

3.  VARIATIONAL  STATEMENT.  To  find  the  form  of  the  variational 
statement,  the  differential  equation  is  multiplied  by  an  arbitrary  variation 
of  the  adjoint  field  variable,  6v(x),  and  integrated  over  the  beam  length. 
Integration  by  parts  indicates  the  form  of  the  variational  statement  and 
the  natural  boundary  conditions.  The  geometric  boundary  conditions  are 
attached  with  the  values  of  the  springs  and  dashpots  playing  the  role  of 
Lagrange  mult i])l iers . The  variational  statement  is  finally  given  by 
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+ (Xc^  + kj^)u(0)v(0)  + (Xc2  + k2)u' (O)v' (0)  (12) 

Performing  the-  variation  of  J with  respect  to  u and  v,  one  can  arrive 
at  the  original  boundary  value  problem  as  well  as  its  adjoint.  In  this  case 
the  two  problems  are  identical. 

4.  FINITE  ELEMENTS.  To  solve  the  problem  using  finite  element  tech- 
niques, the  beam  must  be  divided  into  segments  and  the  nodes  defined.  The 
value  for  the  unknown  variable  within  each  element  must  then  be  expressed 
in  terms  of  the  nodal  values  of  the  function  through  the  use  of  interpo- 
lating shape  functions.  A global  expression,  or  matrix  is  then  formed  and' 
the  eigenvalues  found. 

The  procedure  begins  by  taking  the  variation  of  Equation  (12)  and 
allowing  the  variations  in  the  problem  variable,  6u(x),  to  be  zero. 


/ [u"6v"  + Q(1  - x^)u'6v'  + X^u6v]dx  + 

0 

(Xcj  + kj)u(0)6v(0)  + (Xc2  + k2)u' (0)6v' (0)  = 0 
The  beam  is  divided  into  L elements,  letting 


C = L {x  - i = 1,2,3. ..L 


be  the  running  coordinate  in  each  element.  Substituting  Eq.  (14)  into  Eq. 
(13); 

I ^ (L^  - [C  + (i  - l)]^)u^^^'6v^^^' 

i=l  0 

+ ^^uf^^6v^^^]dC  + (Xcj  + kj)u^^^(0)6v^’^ (0)  + (Xc2  + k2)* 

(0)6v^^^' (0)  = 0 (15) 

In  order  that  the  displacements  and  their  derivatives  within  an  element 
be  expressed  in  terms  of  their  nodal  values,  the  coordinate  vectors 


= vf^  vfh 


are  introduced.  u|  \ represent  the  displacement  and  slope  at  the 

left  end  of  the  itn  element  and  and  represent  deflection  and 

slope  at  the  rightepd.  A similar  interpretation  applied  to  the  adjoint 
coordinate  vector  The  transform  is  indicated  by  T. 

Hermitian  polynomials  are  used  to  relate  the  displacements  within  an 
element  to  its  nodal  values,  hence,  the  following  shape  function  is  assumed. 


a^(C)  = {1  - 35^  + 2^^  5 - 25^  + - 25^  + 5^}  (17) 


So  that 


vCi)(5)  = 


Substituting  Eq.  (18)  into  Eq.  (15) 


L "J* 

{L^C  + [QL  - § (i  _ i)2jb  - ^ E - 2(i  - 1)  ^ D + ^ A}6V^^^ 

+ (Xcj  + + L^(Ac2  + k2)U^^^^F6V*-^^  = 0 (19) 

where 

A = /^a(Qa’^(5)dC  , E = (C)d5 

0 0 

B = /^i'(C)i'^'(5)dC  , F = a'(0)^’r(0) 

0 

C = /^£"(C)i'^"(C)d^ 

0 

6 = /^  5'(C)i'^'(C)dS  . H = i(0)i'^(0)  (20) 

0 


I 


Regrouping  of  (19) , 


. xr'«  . s»>)«v'«  - 0 


where 


= A/L 


i = 1,2, .. .L 


i = 1 


= + CjH  + C2FL^ 

R^’-^  =0  i = 2,3, ...L 

= L^C  + QLB  - Q/L  E + k^H  + k2FL^  i = 1 


(21) 


(22) 


(23) 


= L^C  + QL[1  - 72(1  - 1)^]B  - 7 E - 2(i  - 1)QD/L  i = 2,3, ...L 

Li 


(24) 


Using  certain  continuity  conditions  between  the  element  nodal  values 


u'‘)  . u«-‘> 

1 3 


u'*>  . u»-‘> 

2 4 


vf)  . v“-'> 

1 3 

2 4 


(25) 


One  can  write 


where  now 


0^^^{X^[P]  + X[R]  + [S]}6V  = 0 


(26) 


II 

c 

uf) 

4 3 4 

- 

/I)  yd)  yd)  y(2) 

2 3 4 3 

4 3 4 

[P] » [R] » tS]  are  N x N matrices  (N  = 2L  + 2) . Since  6V  is  arbitrary,  the 
eigenvalue  problem  reduces  to 


U^^^{X^[P]  + X[R]  + [SD  = 0 
for  the  eigenvalues  of  the  problem. 


(27) 
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An  existing  subroutine  was  used  to  find  the  eigenvalues  which  required 
the  standard  eigenvalue  problem  form 

{[A]  + A[I]}U  = 0 • (28) 

The  equation 

{X*[A]  + X[B]  + [C]}U  = 0 (29) 

can  be  reduced  to  Eq.  (28)  by  defining 

W = XU 

This  leads  to  the  matrix  equation 


__[0J I [IJ_ 

-[A]‘^C]  I -[A)-^[B] 

which  is  in  the  required  format.  The  drawback  here  is  that  the  order  of 
the  matrix  has  been  doubled.  Equation  (31),  however,  is  the  form  used  for 
computing  the  eigenvalues. 

5.  RESULTS.  Figures  2 and  3 show  the  effects  for  zero  damping  at 
the  support.  Figure  2 shows  the  effect  of  the  rotation  spring  (k2)  only 
on  the  frequency  with  load  as  a parameter.  The  deflection  spring  is 
assumed  to  be  infinitely  stiff.  For  Q = 0,  the  beam  is  only  vibrating 
and  is  not  rotating.  One  can  see  that  a stiffening  effect  occurs,  i.e., 
the  vibrating  frequencies  increase  with  an  increase  in  the  rotation  spring 
constant.  The  frequencies  rapidly  approach  those  for  a fully  clamped  vibrating 
and  rotating  beam.  These  results  also  fall  within  the  bounds  computed  by 
Boyce,  DiPrima  and' Handelman  [10].  In  Figure  3,  the  rotation  spring  is 
assumed  to  be  infinitely  stiff  and  the  effect  of  varying  the  deflection 
spring  is  shown  for  different  loads.  Again,  in  general,  there  is  a stiffening 
effect  as  the  deflection  spring  value  increases.  For  very  small  values  of 
the  deflection  spring,  the  first  vibrating  frequency  decreases  slightly  for 
increased  loads,  although  only  Q = 0 and  200  are  shown.  As  kj  is  increased, 
there  are  cross  over  points  after  which  higher  loads  do  imply  higher 
frequencies. 

Figure  4 shows  the  effect  of  rotation  damping  at  the  support  of  a beam 
having  rotation  flexibility  at  the  support.  The  deflection  spring  is 
assumed  infinitely  stiff  and  the  deflection  dashpot  is  zero.  Tlie  figure  is 
for  a specific  value  of  the  rotation  spring,  k2  = 1,  and  shows  the  first 
two  eigenvalues  for  each  of  two  loads,  Q = 0 and  Q = 100.  A stiffening 
effect  is  found  for  increasing  damping  for  Q ^ 0.  For  Q = 0,  there  is  very 
slight  decrease  for  very  small  damping  values.  For  smaller  rotation  spring 
constants,  and  zero  load  frequencies  decrease  with  increased  damping  as 
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shown  in  Figure  4 by  the  portion  of  the, results  for  V.2  ~ These  results 

are  better  shown  in  Figure  S.  The  results  in  Figure  4 are  interesting  since 
one  would  expect  a decrease  in  frequency  as  damping  is  increased.  Stiffen- 
ing effects  due  to  damping  are  found  elsewhere  [5]  and  could  be  due  to  the 
manner  in  which  it  is  introduced  in  the  problem.  Figure  4 also  shows  that 
the  results  for  the  fully  damped  beam  are  approached  rapidly  as  damping 
is  increased.  Figure  5 shows  the  vibrating  frequencies  on  a complex 
plane  for  a beam  with  rotation  flexibility  and  damping.  The  load  is  zero 
(non-rotating  beam)  and  the  rotation  spring  is  kept  at  k2  = .1  while  the 
dashpot  value  changes.  The  arrows  adjacent  to  the  curves  show  the  direc- 
tion of  the  values  on  the  curve  as  damping  increases.  For  zero  damping, 
the  results  are  purely  imaginary  and  are  approximately  .54  and  15.5  for  the 
1st  and  2nd  frequencies.  As  damping  increases,  the  frequencies  become 
complex  with  the  imaginary  components  decreasing  for  the  first  eigenvalue 
and  increasing  for  the  second.  The  behavior  of  the  first  frequency  is 
interesting.  As  the  damping  value  increases,  the  imaginary  component 
vanishes  (as  also  seen  in  Figure  4)  as  if  the  system  becomes  critically 
damped.  However,  the  real  component  can  also  be  followed  on  the  complex 
plot  and  the  beam  appears  to  vibrate  again  in  this  first  mode  as  damping 
increases  further.  For  sufficiently  large  damping  the  frequencies  seem 
to  approach  those  for  a beam  which  is  damped  at  the  support.  Points  on 
the  real  axis  represent  zero  motion  but  move  with  changes  in  damping 
values  to  points  on  the  real  axis  where  it  is  intersected  by  a branch  or 
mode.  Figure  6 shows  the  same  results  for  load  Q = 25.  A final  result  for 
rotation  flexibility  is  shown  in  Figure  7 for  k2  = 10.  Here  the  rotation 
spring  is  relatively  stiff  and  the  fully  damped  results  are  rapidly 
approached  with  initial  effects  for  near  zero  dashpot  values  overshadowed. 


The  effect  of  support  damping  on  the  frequencies  on  beams  with 
deflection  flexibility  are  shown  in  Figures  8-10.  A decrease  in  frequency 
with  increased  damping  is  seen  here.  Figures  9 and  10  show  the  effect  on 
the  complex  plane  for  Q = 0 and  Q = 200,  respectively. 

Finally,  Figure  11  shows  the  results  of  a case  when  both  rotation 
spring  and  deflection  spring  flexibilities  are  allowed.  Little  effect 
on  frequency  is  noted  as  the  rotation  spring  is  varied  while  the  deflec- 
tion spring  and  dashpot  remain  constant  in  value.  An  almost  parallel 
increase  in  frequencies  are  found  when  the  rotation  dashpot  is  increased 
in  value.  The  investigation  into  the  response  of  the  beam  with  all  springs 
and  dashpots  finite  was  limited  to  those  shown.  A study  should  be  per- 
formed to  indicate  areas  where  the  effects  on  beam  response  will  be  most 
pronounced. 
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AN  EVALUATION  PROCEDURE  FOR  INCOMPLETE  GAMMA  FUNCTIONS 


Walter  Gautschi 
Mathematics  Research  Center 
University  of  Wisconsin-Madison 
Madison,  Wisconsin  53706 


ABSTRACT.  A computational  procedure  is  developed  for  evaluating  Tricomi's  in- 

complete  gamma  function  y (a,x)  = (x"®/r(a))  / e”^t^"  dt,  and  the  complementary 

0 

OD 

incomplete  gamma  function  r(a,x)  = / e~^t^"  at,  both  in  the  region  x ^ 0,  -»  < a < ® 

X 

Each  of  these  functions  can  be  obtained  from  the  other  by  means  of  simple  interrelation- 
ships. The  choice  of  primary  function,  i.e.,  the  function  to  be  computed  first,  will 
be  dictated  by  considerations  of  numerical  stability  and  computational  conveniti.ce.  In 

the  strip  0 £ x £ 1 .5,  -«>  < a < ® , the  choice  goes  to  y (a,x),  which  is  easily 
evaluated  by  Taylor's  series.  This  entails  certain  difficulties  for  r(a,x),  when  a 
is  very  close  (or  equal)  to  a nonpositive  integer,  but  these  can  be  dealt  with  by  a 

careful  analysis  of  the  limit  behavior  of  r(a,x)  as  a -m,  m = 0,  1,  2 The 

★ 

function  y (a,x)  continues  to  serve  as  primary  function  in  the  region  a ^ x ^ 1.5, 
where  it  can  be  effectively  evaluated  by  a continued  fraction  due  to  Perron.  In  the 
remaining  region  x ^ 1.5,  a < x,  the  primary  function  is  taken  to  be  r(a,x),  and 
is  evaluated  by  a classical  continued  fraction  of  Legendre. 

The  complete  paper  is  available  as  MRC  Technical  Summary  Report  #1717,  February 

1977. 


A METHOD  OF  EVALUATING  LAPLACE  TRANSFORMS  WITH 
SERIES  OF  COMPLETE  OR  INCOMPLETE  BETA  FUNCTIONS 
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Alexander  S.  Elder 
Emma  M.  Wineholt 
Propulsion  Division 
US  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  Maryland  21005 

ABSTRACT.  In  a previous  paper  factorial  series  were  used  to  calculate 
ordinary  and  modified  Bessel  functions  of  the  second  kind.  In  the 
present  paper  the  factorial  series  is  generalized  so  that  Laplace  integrals 
in  which  the  integrand  has  a branch  point  at  the  origin  are  represented 
by  a series  of  beta  functions.  To  effect  the  required  transformation, 
formulas  for  calculating  Stirling  numbers  of  fractional  order  were  derived; 
these  were  used  in  the  same  manner  as  the  Stirling  numbers  of  integer 
order  are  used  to  calculate  the  coefficients  of  a factorial  series. 

Formulas  for  calculating  Kq(x)  and  K^(x)  have  been  derived  and  programmed, 
using  these  modified  Stirling  numbers.  Formulas  for  calculating  Ip(x) 
and  Ij^(x)  have  been  derived  and  programmed  using  series  of  incomplete 
beta  functions  in  a similar  algorithm.  Results  for  and  K^(x)  agree 

to  thirteen  significant  figures  when  x>8  and  for  and  Ij^(x)  when 

x>15.  The  modified  Stirling  numbers  increase  very  slowly  with  order 
and  index  since  gamma  functions  do  not  occur  in  the  definition. 

Consequently  no  problems  with  overrun  of  the  electronic  computer  occurred 
during  the  course  of  the  calculations. 


1.  INTRODUCTION.  Factorial  series  for  Bessel  functions,  confluent 
hypergeometrlc  functions,  and  certain  other  special  functions  can  be  used 
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to  check  the  accuracy  of  calculations  for  these  functions  provided  the 
argument  is  not  too  small.  Generally,  if  a function  is  analytic  in  the 
right  half  plane,  including  the  imaginary  axis,  and  can  be  represented 
as  a Laplace  transform,  then  a factorial  series  can  be  derived  which  will 
converge  in  the  right  half  plane.  Buchal  and  Duffy  (1)  obtained 
factorial  series  for  Hankel  functions.  Coulomb  wave  functions,  and 
Mathieu  functions.  These  authors  also  studied  the  convergence  properties 
of  the  factorial  series  for  the  Hankel  functions  in  considerable  detail. 
Their  results  showed  that  factorial  series  were  more  accurate  than  Hankel 
asymptotic  series  for  the  same  argument.  The  analysis  was  based  on 
Bernoulli  polynomials  as  discussed  by  Doetsch  (2)  and  Milne  - Thomson  (3). 

The  analysis  in  this  paper  is  based  on  an  algorithm  of  Wasaw  (4) 
which  uses  Stirling  numbers  of  the  first  kind  to  calculate  coefficients 
for  the  factorial  series.  The  Stirling  numbers  of  the  first  kind 
increase  very  rapidly  with  order,  eventually  obtaining  overrun  in  the 
electronic  computer.  Moreover,  the  factorial  series  for  complex  argument 
is  awkward  if  there  is  a branch  point  at  the  origin.  Rosser  (5) 
obtained  a generalized  factorial  series  for  modified  Bessel  functions 
of  the  second  kind  by  direct  manipulation  of  the  Laplace  transform  and 
also  established  the  convergence  properties.  His  analysis  is  quite 
difficult  and  requires  a separate  treatment  for  each  case.  In  this 
paper  we  derive  Stirling  numbers  of  the  first  kind  and  fractional  order, 
leading  to  a generalization  of  Wasaw' s algorithm.  Overrun  in  the 
computer  is  eliminated  by  scaling  the  Stirling  numbers  of  integral  and 
fractional  order  by  omitting  the  gamma  functions  in  the  definition. 

- — 


A further  generalization  of  factorial  series  is  required  if  the 
Laplace  integral  representing  the  function  is  evaluated  between  finite 
limits.  If  a factorial  series  is  regarded  as  a series  of  beta  functions. 
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it  is  logical  to  represent  a Laplace  integral  with  finite  limits  in 
terms  of  a series  of  incomplete  beta  functions.  The  new  series  obtained 
by  this  method  is  convergent  even  though  the  corresponding  factorial 
series  may  be  divergent. 

2.  MODIFIED  STIRLING  NUMBERS  OF  THE  FIRST  KIND  AND  FRACTIONAL  ORDER 

Stirling  numbers  of  the  first  kind  are  defined  as  coefficients  which 
occur  when  a factorial  is  expanded  into  a polynomial  (6),  (7). 

X (x-1)  (x-2)  ...  (x-n+1)  = ^2  x™  (1) 

111=0 

Clearly  m must  be  an  integer  in  the  above  equation.  However,  a 
generating  function  involving  logarithms  is  not  subject  to  this 
restriction. 
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In  order  to  avoid  the  gamma  function  and  non-integral  Indices  we  define 
by  the  equation 
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^In  (l+x)j 


k.=o 


k 


(4) 


where  v may  take  on  fractional  as  well  as  integral  values.  When  the 
have  been  calculated,  Stirling  numbers  of  the  first  kind  may  be 
obtained  from  the  following  equations. 


- r (n+1)  / r (nri-1) 

n n-v 


(5) 


where  v = m 

k = n - V 


(6) 

(7) 


To  find  t divide  Eq  (4)  by  x^  and  evaluate  the  limit  of 
each  side  of  resulting  equations  as  x ->  0.  We  obtain 


W<'^)  - 1. 
o 


(8) 


We  can  also  prove  that 


- 0 
k 


, k > 1 


(9) 


and  W 


(o) 


(10) 


We  now  derive  a sequence  of  triangular  equations  for  calculating 
W^'’^  W^'*^  ....  W^'^^  in  turn.  Let 
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y(x)  = [An  (1+x)] 


(1 


and  u(x)  = (x+1)  An  (1+x). 


(i: 


Then  u(x)  y'(x)  = vy(x) 


(13 


But  y(x)  - £ U'"*  X 


y’(x)  = H (v+k)  X 


v+k-1 


(14) 


k=o 


.1  2 13.1  4 

u(x)  = x + -2  X 


...  + (-1) 


k X , k>l  (15) 


k(k-l) 


On  inserting  these  series  into  Eq  (13) , carrying  out  the  indicated 
multiplication  and  equating  coefficients  of  like  powers  of  x on  each 
side  of  the  resulting  equation,  we  find 


(V) 

L 
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Oil  re-arranging  these  equations  we  find 


i v=  0 


2 - | = 0 


3w('^)+:^  mCv).  „(v)  . v__ 

^ 3 2 2 6 1 12 


= 0 


(16) 

(17) 

(18) 


and  in  general 


, , 1-1 


'’1 


k=o 


(-1) 


1+k+l 


v+k 


.(V) 


(l-k)(l-k+l)  k 


(19) 


To  find  a recurrence  formula  Involving  different  orders,  differentiate 
Eq  (4)  with  respect  to  x and  then  multiply  both  sides  of  the  resulting 
equation  by  (x+1): 


V [In  (1+x)]  = X)  W^''^  (v+k)  (x+1)  X (20) 

k=o 

If  we  replace  v by  (v-1)  in  Eq  (4)  and  multiply  both  sides  of  the 
resulting  equation  by  v , we  find 


V [ln(l+x)]  = 


00 


z 

k“o 


„(v-l)  v+k-1 

W,  vx 


(21) 
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On  comparing  coefficients  of  like  powers  of  x in  the  last  two  equations 
we  find 


'^k+i  " f '^k+i^^  " ^ ^ 

Hence  we  can  calculate  from  and  Higher 

order  numbers  can  be  generated  in  succession  in  the  same  manner.  The 
values  of  must  be  calculated  from  Eq  (19)  before 

the  recurrence  formula  given  by  Eq  (22)  can  be  used;  a double  entry 
table  is  finally  obtained. 

Wasaw  uses  Schlomlich's  definition  of  factorial  coefficients  in 
his  development  of  factorial  series  (7)  : 

n-1 

X (x+1)  (x+2)  ...  (x+n-1)  = ^ x"  . (23) 

m=o 


It  follows  that  the  factorial  coefficients  and  Stirling  numbers  of  the 
first  kind  are  related  by  the  formula 


and  for  fractional  orders 


[ - «„  (l-x)  ]'>  . Z v'”’ 


(26) 


k=o 


Finally,  we  obtain  the  following  recurrence  formulas  in  the  manner 
indicated  previously; 


S,  = 0 

0-  (i-k)(X,-k+l)  ''k 


'^k+l  " ^ ^k^^  ^ ^ lv+k+1] 


(27) 


(28) 


3.  GENERALIZED  FACTORIAL  SERIES 

We  now  derive  an  extension  of  Wasaw's  algorithm  for  a Laplace 
integral  to  functions  with  a branch  point  at  the  origin.  The  branch 
point  involves  fractional  powers,  in  the  same  context  as  Watson's 
Lemma  (8);  logarithmic  branch  points  are  not  considered.  Assume 


F(x) 


00 

£ 


f(t)  dt 


(29) 


and  let 


then 


On  referring  to  Eq  (26)  we  see 


1 

F(x)  = i:  i:  / u ci-u)  ^-^du . oa) 

n=o  k=o  *'o 

Now 


B (a, 6) 


so  that 


(35) 


00  00 

F(x)  ” ^ ^ ^n  ^ (k+n-h;+l,x) 

n^o  k=o 


(36) 


k+n=ll  (37) 


and  define 


b = V a 

“j!.  2-(  ^n  £-n 

n=o 


(38) 


then 


F(x)  = E b B(v+Jl+l,x) 
i,=o  ^ 


On  noting  that 


(39) 


, , . „ , , V (v+1)  . . . (v+Jt) 

B(v+j!,+l,x)  = B (v,x)  (x-K,)(x-h)+l)...(x+v+j!,) 


(40) 


and  using  Pochhairaner ' s symbol  to  represent  the  factorials,  we  find 


F(x)  = B (v  ,x)  E ('')«,+!  / (^"^^ 


S,=o 


Jl+1 


(41) 


in  a formal  sense.  By  analogy  with  conventional  factorial  series, 
the  series  should  converge  in  a half  plane  which  lies  to  the  right  of 
the  imaginary  axis.  Details  of  the  required  analysis  will  not  be 
considered  at  this  time. 
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4.  ON  A SERIES  OF  INCOMPLETE  BETA  FUNCTIONS.  Since  a generalized 
factorial  series  is  in  fact  a series  of  beta  functions,  it  is  natural 
to  represent  a Laplace  integral  with  finite  limits  of  integration  as  a 
series  of  incomplete  beta  functions.  The  lower  limit  of  integration 
can  be  taken  equal  to  zero  without  loss  of  generality.  Assume 


F(x) 


tx 


f(t)  dt,  T>0 


(42) 


Let 


e = l-e~^ 


(43) 


then 


F(x)  =r  f[-J!,n(l-u)]  [1-u]  ^“^du 
•'o 


(44) 


On  referring  to  Eq  (26)  we  find 


00  00 


F(X)  = x:  E V, 


(v+n) 


n“o  k=o 


/ “ 


v+k+n.,  sX-1, 

(1-u)  du. 


(45) 


Since 


B^(a,e) 


(1-t) 


e-1 


dt 


(46) 
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we  find 


00  00 


F(x)  = ^ 

n=o  k.=o 


(47) 


F(x)  = £ B^(v+)l+l,x) 


1=0 


(48) 


on  referring  to  Eqs  (37)  and  (38). 

To  compute  the  incomplete  beta  function,  set 

v+f.=  6 

and  use  the  formula 


B (6+1, x)  = B(x,6+1)  - B,  (x,6+l) 
e i- 1 


(49) 


(50) 


The  beta  function  on  the  right  side  of  Eq  (50)  was  expressed  in  terms 
of  gamma  functions,  as  shown  in  Eq  (51): 


B(x,6+i)  = r (x)  r (6+1)  / r (x+6+i). 

The  gamma  functions  were  obtained  from  the  subroutine  CDLGAM  by 
H.  Kukl  (10) . This  subroutine  is  valid  for  both  real  and  complex 
values  of  x. 


(51) 
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The  Incomplete  beta  function  on  the  right  side  of  Eq  (50)  is  given 


by  the  integral  formula 

6+1)  = r ^x-1  _ 

•b 


(52) 


On  expanding  the  binomial  factor  in  the  Integrand  and  integrating  term 
by  term  we  find 


®l-e 


(1-e)^  _ A (l-e)'^'^^ 
x ll  (x+1) 


6(5-1) 

21 


(1-E) 


x+2 


(x+2) 


6(5-1)  (6-2)  (1-  , 

3!  (x+3) 


(53) 


This  series  is  satisfactory  if  6 is  small,  but  is  subject  to  round-off 
error  if  6 is  large  and  positive.  To  overcome  this  difficulty,  integrate 
the  right-hand  side  of  Eq  (52)  repeatedly  by  parts.  We  find 


(x,6+l) 


(l-e)’^  6 6(l-e)^'*'^  6-1 

X ^ x(x+l)  ^ 


6(6-1)  (l-t)^'^^  5-2  6(6-1)  (6-2)...  (6-m)  ^ 

x(x+l)(x+2)  x(x+l) (x+2) . . . (x+m)  m 


where 


R 

m 


^1- 


e 


(x+m+1,  6-m)  . 


(55) 


We  choose  m so  that 


1 < 6-m  < 2. 


(56) 


Then  the  terms  of  the  series  given  by  Eq  (54)  are  positive  when  x is 

real  and  positive,  and  consequently  the  round  off  error  should  be  small. 

The  remainder  R is  calculated  from  Eq  (53) . 
m 


R = 
m 


x+m+1 


(1-e)" 

x+m+l 


(6-m-l)  (6-m-2)  (1- 
2 1 (x+m+3) 


All  the  terms  of  this  series  after  the  first  term  are  negative,  and 
decrease  rapidly  in  magnitude.  Hence  the  error  in  calculating  R^  should 
be  small. 

5 . CALCULAT IONS . We  calculated  the  modified  Bessel  functions 
K^(x),  Kj^(x),  I^(x),  and  (x)  from  the  integrals: 


V (x)  = — ^ — 

o^*^  r(i) 


e‘’'*^(t^+2t)"^dt 


’Ki(x)  - ^ J e‘’'*'(t^+2t)*dt 


(58) 

(59) 


V = i (63) 


in  Eqs  (59)  and  (61).  Hence  the  modified  Stirling  numbers  Vj^  and 
4-fm 

Vj^  are  required  for  the  coefficients  of  the  factorial  series.  A 
short  table  of  these  numbers  is  given  in  Table  1. 

I 

Next,  the  coefficients  a^  for  the  series  expansions  of  (1+^t) 

(1-it)  (1+it)^,  and  (1-H)^  were  calculated  from  the  appropriate 

recurrence  formulas,  as  shown  in  Table  2. 

The  coefficients  b^^^  were  calculated  from  Eq  (38)  for  each  of  the 
four  cases  listed  above.  In  addition,  the  partial  sums 

m 

E n <“> 

.^v=0 

were  calculated  in  order  to  study  the  convergence  of  the  factorial 

00 

series.  The  series  ^ C must  converge  if  the  corresponding  factorial 

ra 

series  is  to  converge.  This  condition  is  apparently  violated  for  the 
functions  Iq(x)  and  Ij^(x),  which  shows  why  the  factorial  series  for  these 
functions  apparently  diverged.  These  results  are  given  in  Table  3. 
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515eseiC27E  C4  -0.242139C452E  03  -0.3988377918E-03  0. 8367300496E-02 
5932S5C354E  C4  -0. 2749495588E  03  -0,392e963534E-03  0. 82702634 58E-02 
68234591928  C4  -0. 3122761345E  03  -0.3871123162E-03  0.8175449148E-02 
70481O64O6E  C4  -0. 354747e056E  03  -0.38147974356-03  0.80827e2095E-02 
9027163686E  C4  -0.4030820658E  03  -0.3759929875E-03  0.799219C2C0E-02 


1 


The  beta  functions  and  Incomplete  beta  functions  required  in  the 
series  expansions  for  K^(x),  Kj^(x),  I^(x),  and  Ij^(x)  were  calculated 
from  formulas  discussed  previously.  Sample  tabulations  are  shown  in 
Table  4.  Finally,  the  modified  Bessel  functions  were  calculated  for 

a limited  range  of  variables.  These  results  are  shown  in  Table  5.  ; 

6.  RESULTS  AND  CONCLUSIONS.  These  series  expansions  in  terms 
of  beta  functions  and  incomplete  beta  functions  were  derived  in  order 
to  check  the  accuracy  of  our  Bessel  function  subroutine  (11)  (12)  with 
independent  calculations.  The  error  analysis  of  our  subroutine  by 
theoretical  methods  would  be  very  difficult,  especially  for  the  section 
Involving  continued  fractions.  Hence  computational  efficiency  is  a 
secondary  consideration  for  the  new  series  expansions.  Addressing  the 
comment  of  the  reviewer*,  we  believe  our  algorithm  for  the  Incomplete 

3 

beta  function  is  as  efficient  as  the  continued  fractions  of  Begun  (6) 
when  X is  large,  as  Eq  (54)  can  then  be  used  without  the  remainder. 

We  have  not  made  any  specific  comparison  for  small  values  of  x. 

Since  factorial  series  are  a method  of  summing  certain  asymptotic 
series,  they  are  most  effective  for  large  and  moderately  large  values 
of  the  argument.  The  convergence  is  slow  when  x is  small,  so  that 

an  excessive  number  of  terms  is  required.  Round-off  error  may  occur  in  j 

the  coefficients  b^  and  in  summing  the  series.  Alternate  methods  of  1 


calculating  Bessel  functions,  such  as  quadratures,  are  required  when 
X is  small.  Subroutines  used  for  checking  should  not  use  continued 
fractions  or  other  procedures  used  in  the  subroutine,  to  insure  the 
calculations  are  in  fact  independent. 


* See  ACKNOWLEDGMENTS 
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T4BLE  4.  INCO^'PLETE  AND  COMPLETE  BETA  FUNCTIONS 
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The  generalized  factorial  series  for  and  K^(x)  yield  accurate 

numerical  values  when  x is  only  moderately  large  and  the  Hankel 
as5nnptotic  series  is  not  sufficiently  accurate.  On  the  other  hand,  the 
new  series  for  and  Ij^(x)  have  the  same  range  of  accuracy  as  the 

Hankel  asymptotic  series,  and  do  not  offer  any  computational  advantage. 
This  is  probably  due  to  the  apparent  divergence  of  the  series  for  the 
partial  sums  of  b^. 

The  generalized  factorial  series  for  K^(x)  and  Kj^(x)  are  being 
extended  to  the  complex  plane.  Programming  of  generalized  factorial 
series  for  the  ordinary  Bessel  functions  is  in  progress.  Alternate 
methods  of  calculating  Ijj(x)  are  also  being  considered,  as  the  results 
obtained  in  this  paper  fell  short  of  our  expectations. 

The  modified  Stirling  numbers  as  defined  in  this  paper  are  more 
useful  for  computations  involving  Wasaw’s  algorithm  than  the  original 
Stirling  numbers,  as  problems  arising  from  very  large  numbers  and 
overrun  of  the  computer  registers  are  entirely  avoided.  The  method  of 
scaling  employed  in  this  paper,  which  merely  involves  the  omission  of 
gamma  functions  in  the  definition  of  Stirling  numbers  is  more  effective 
than  the  method  of  scaling  used  in  previous  paper  by  the  authors  (13) . 
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APPROXIMATION  OF  IRREGULAR  SURFACES 


Helmut  M.  Sassenfeld 

US  Anny  TRADOC  Systems  Analysis  Activity 
White  Sands  Missile  Range,  New  Mexico  88002 

ABS1RACT 

A method  is  outlined  to  approximate  irregular  (empirical)  surfaces 
z = f(x,y)  in  the  real  domain,  provided  z is  single-valued  when  the 
inverse  functions  x and  y may  be  multi-valued,  using  sets  of 
quadratic  and  linear  expressions  for  constant  and  variable  z values 
and/or  superposition  of  simple  analytically  described  surfaces.  The 
parameters  of  the  approximation  elements  are  geometrically  identifiable 
and  so  easily  obtained  from  graphs  or  numerical  values  of  z.  A pro- 
cedure is  given  to  interpolate  between  the  approximation  curves  that 
applies  a simplified  gradient  and  insures  unambiguous  z approximations 
in  the  given  domain.  The  interpolation  algorithm  further  contains  an 
associative  look-up  that  considerably  reduces  the  computational  effort 
for  highly  detailed  approximations  for  points  in  close  vicinity.  The 
method  achieves  fairly  good  approximations  with  much  fewer  parameters 
than  polynomial  approximations  and/or  grid  point  data  sets.  The 
restriction  to  easily  treatable  approximation  elements  makes  for  com- 
putational effectiveness  and  analytic  simplicity.  The  method  is  being 
used  to  approximate  terrain  for  simulation;  it  can  effectively  be 
applied  to  approximate  other  functions  f(x,y)  especially  those  that 
require  large  amounts  of  data  in  digitized  form.  Two  examples  of 
applications  to  terrain  are  given. 
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INTRODUCTION 


I 


The  methods  to  approximate  irregular  surfaces  described  "here 
resulted  from  an  attempt  to  approximate  terrain  by.zTl iniited  num- 
ber of  analytical  expressions  in  order  to  overcome 'tne  need  for 
huge  amounts  of  digital  terrain  dati  for  combat  simulation,  espe- 
cially since  data  of  that  kind  were  only  available'for  a few  areas. 

The  approximation  by.  analytical  functions  also  rec^uces  the  effort 
of  line  of  sight  computation,  i.e.,  determination  v/hether  direct 
line  (of  sight)  between  two  points  on  the  surface  is  obstructed  by 
some  part  of  the  surface  itself.  The  process  of  line  of  sight 
computation  does  use  a significant  amount  of  computing  resources 
for  several  combat  models. 

The  methods  presented  here  are  de  facto  empirical,  i.e.,  even 
in  principal  any  desired  accuracy  of  approximation  can  be  obtained 
by  reapplying  the  process;  the  process  can  become  more  cumbersome 
after  applying  a limited  set  of  approximation  elements  and  then  to 
find  more  approximation  elements  to  obtain  another  order  of  magni- 
tude of  accuracy.  This  does  not  invalidate  tbe  objective,  v/hich 
is  to  approximate  the  gross  structure  of  an  irregular  surface  by 
simple  means  and  not  to  achieve  utmost  accuracy. 

Two  distinct  approaches  to  approximate  surfaces  are  presented 
here,  the  contour  line  approach  (Ref  1)  and  the  superposition  of 
elementary  surfaces.  The  first  method  is  more  effective  for 
“rugged"  surfaces  with  few  macro  structure  elements.  Tne  second 
has  the  advantage  of  greater  simplicity  and  also  easiness  of 
extending  the  approximated  surface  without  recomputing  the  approx- 
imation elements  and  still  achieve  continuity  beyond  the  original 
area  of  consideration. 

The  two  methods  are  described  and  examples  given.  The  process 
of  1 ine  of  sight  determination  is  also  outlined  briefly. since  it 
is  of  such  basic  interest  to  the  current  application  of  the  method. 

CONTOUR  LINE  APPROXIMATION  . 

We  assume  that  'the  surface  2 = F(x,y)  to  be  approximated  is 
continuous  and  single  valued.  The  inverse  functions  x = g{x,2) 
and  y = g(x,2)  may  be  multivalued.  Then  the  surface  can  be  de- 
picted by  a set  of  curves  Zv  ~ = const  (contour  lines).  The 

2 may  be  sets  of  several  distinct  curves  for  a given  ^ . Curves 
for  different  2^  do  not  intersect.  A set  of  such  contb’ur  lines 
is  depicted  in  Fig,  1 . 

i 

i 
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Fig.  1 


For  ease  of  description  we  will  refer  sometimes  to  z as  eleva- 
tion. i 

We  approximate  the  contour  lines  by  sets  of  quadratic  or  linear 
expressions.  Frequently  contour  lines  have  pronounced  indentations 
or  bulges.  Such  curves  can  be  approximated  by  defining  pseudo  con- 
tour lines.  Pseudo  contour  lines  are  analytically  simple  approxi- 
mation curves  and  that  have  elevation  differentials  for  certain 
points  assigned  along  their  arc.  This  avoids  the  use  of  higher 
order  approximation  elements  that  result  in  lengthy  expressions. 


I 


Approxir.ation  Elements 


We  will  restrict  the  choice  of  curves  to  ellipses,  parabolas 
and  polygons  as  elements  for  the  approxir.ation  of  contour  lines 
and  pseudo  contour  lines.  The  main  reason  for  this  restriction  is 
the  ease  in  obtaining  the  relevant  parameters  from  given  contour 
lines  and  also  to  allov.i  explicit  line  of  sight  computations  which 
are  important  for  our  application  of  the  method  to  combat  modeling. 


Ellipses.  An  ellipse  is  defined  by  five  parameters,  but  for  prac- 
tical purposes  we  will  use  six  parameters  (two  are  not  independent), 
i.e.,  the  coordinate  vector  to  its  center  and  the  four  parameters 
of  its  transformation  matrix.  Given  coordinate  vector  s?  of  the 

center  in  the  k coordinate  system,  then  the  transformation 
0* 


(1) 


= (x\y') 


(s° 


)• 


a"^cosx°x^  b"^cosx°y^ 
a”^cosy°x^  b~^cosy°y^ 


reduces  the  equation  of  the  ellipse  to  a unit  circle 
(2)  s^  • s^  = 1. 

It  is  also  true  that  a point  s^  is  inside  the  ellipse  if  the 
scalar  product  is  less  than  one  and  outside  if  the  scalar  product 
is  larger  than  one,  a simple  criterion  that  will  be  used  later. 

Parabola.  -We  will  use  parabolas  (Fig.  3)  given  by  three  points 
and  defined  by  the  equation 


(3)  S = Ut^  + Vt  + S2  U = (S]  - 2S2  + S3)/2 


The  vectors  u and  v can  be  precomputed  from  the  s^.  The  dimen- 
sionless parameter  t is  -1,  0,  and  +1  for  sj , S2,  and  S3  respect- 
ively. Note  that  there  are  actually  three  different  parabolas 
possible  through  three  points  and  the  labeling  of  s determines  the 
"mid-point"  and  the  desired  parabola.  '' 

Polygons.  Polygons  are  represented  by  sets  of  straight  lines 
according  to  the  equations  (Fig.  3). 

(4)  s = s + (s  , - s )t  V = 1,  2,  3 ...n(mod  n) 

V V + 1 V 

The  points  should  be  labeled  cylically.  For  the  sides  (or 
points  on  the  sides)  of  the  polygons  is  0 <t  <1,  a criterion  that 
will  be  used  later  to  decide  whether  a point  Ts  inside  the  polygon. 

Pseudo  Contour  Line  Approximations 

To  approximate  pseudo  contour  lines  we  also  need  to  define  a 
AZ  at  given  points  on  the  curves  defined  above.  As  can  be  seen 
on  Fig.  2.  The  fact  that  we  have  an  "indentation"  or  a "bulge" 
depends  on  the  sign  of  az.  The  abruptness  of  change  from  a 
"smooth"  contour  line  depends  in  addition  to  the  amount  of  az  on 
the  closeness  of  other  contour  lines.  For  an  ellipse  pseudo  con- 
tour lines  can  be  approximated  for  four  points  at  the  intersections 
witii  the  coordinate  system  by 

(5)  z = z_  + L (g  -g  + Ig  -gOA  /2(ig  I -Igi) 

where  g is  the  coordinate  vector  of  the  point  on  the  circumference 
for  which  the  elevation  is  to  be  established  and  the  g^  are  the 
unit  vectors  of  the  given  points  with  Ay  elevation  differentials. 
The  sums  of  the  scalar  products  with  their  absolute  value  really 
indicate  that  there  are  never  more  than  two  terms  of  the  sums, 
namely  the  two  g^  that  limit  the  quadrant  in  which  g is  located. 
The  interpolation  actually  uses  a Fourier  form  to  avoid  the  use 
of  transcendental  functions. 

Procedure 


To  approximate  a surface  by  elements  as  defined  above  one 
establishes  from  the  contour  lines  the  approximate  center  points 
and  axes  of  ellipses  (for  those  contour  lines  that  resemble 
ellipses)  and  precomputes  the  parameters  of  the  transformation 
matrix.  For  the  other  contour  lines  one  chooses  either  parabolas 


through  three  points  or  polygons  with  convenient  points.  Parabolas 
can  be  chained  to  form  any  variety  of  curves.  With  all  derived 
parameters  precomputed  one  obtains  a set  of  parameters  that  describes 
the  approximating  surface  and  will  be  referred  to  as  the  structure 
set.  To  regenerate  the  approximate  surface  one  proceeds  as  outlined 
in  the  next  chapter  and  the  algorithm. 

Interpolation 

Once  we  have  established  a set  of  approximation  elements  and  their 
parameters  we  can  describe  the  contour  lines  of  a surface  as  shown 
in  Fig,  1.  For  a point  on  the  area  under  consideration  we  have  to 
compute  the  z-  value  (elevation),  but  since  normally  a point  would 
not  fall  on  a contour  line  we  have  to  interpolate  between  adjacent 
contour  lines.  The  correct  interpolation  is  along  the  gradient. 

To  obtain  the  gradient  in  a system  (set  of  approximation  elements) 
is  equivalent  of  using  an  irregular  curved  linear  coordinate 
system.  There  are  also  cases  where  there  are  more  than  two 
adjoining  contour  lines  (Fig.  1)  and  the  minimum  distance  must  be 
chosen.  Since  the  gradient  and  especially  its  length  between  a 
point  and  an  approximated  contour  line  cannot  be  computed  explicitly 
we  are  using  the  simplifications  indicated  below. 


For  ellipses  we  use  the  distance  between  the  point  A and  the 
periphery  of  the  ellipse  along  the  straight  line  that  goes  from 
the  center  of  the  point  A (Fig.  4).  Using  the  transformation 
indicated  in  (1)  we  receive  for  the  distance  with 

(6)  (PA-Si)-^-  = Wi  ($^-sJ-T^=w^ 


D2  = (1-IW. l)2-(p^-S.)'(p^-S.) 
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and  if  the  respective  values  of  z -=  const  are  z.  and  z,  this 
interpolates  to  i k 

(7)  z^  = (z.dJ'  + Zj.d'[')/(D."'  + Dj,"')  m = 1 or  2 

For  n;=l  we  have  linear  interpolation  and  for  r>-2  we  have  Pythag- 
orean interpolation  which  saves  the  computation  of  square  roots 
in  the  numeric  process.  For  confocal  ellipses  these  approximations 
can  be  made  more  accurate,  see  reference  1. 


For  parabolas  the  distance  from  a point  to  the  parabola  is 
approximated  as  follows.  One  uses  a center  point  that,  for  all 
practical  cases,  is  inside  the  parabola  and  reasonably  far  away 
from  the  parabola  itself.  Such  a point  is_^halfwa^  between  the 
intersections  of  lines  orthcconal  to  S5  - S2  and  Sj  - S2  through 
the  points  P3  and  pj  respectively  and  the  bisector  of  the  angle 
Pi  Pz  P3  shown  in  Fig.  (5). 

-+• 

The  point  P^  defined  as 


18)  p^  = + +^'52)  (IS12I  + 15320/2(1  + Sj2-  ^32) 


s =s-s.s  =s  /is  I 

VV  V V uv  pv  UV 


For  the  distance  between  point  A and  the  parabola  we  obtain  the 
approximation 


(9)  D 


Ap 


‘a  ^cA'^cA 


cA 


Pc  - Pa 


= [2<vxu>t^  + <(S2-P^)  X Li>]/y 


[i  D-6]/t 


6 = <p^^  X v> 


^cA 


<cxc>  = ac  -ac  , t?sl 
X y X A 


if  ti  is  larger  than  one  the  point  A lies  outside  the  validly 
defined  portion  of  the  parabola.  There  are  formulas  for  these 
pathological  cases,  reference  2. 
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T^e  point  A is  outside  the  defined  area  of2the  parabola  if 
t^sl  and  T^^l-  The  point  A is  inside  if  t^^l  and  1<T^- 

Algorithm  for  surface  generation  and  interpolation. 

Given  a set  of  approximation  elements  e^  whose  components  are 
identification  as  to  type  (ellipse,  parabola  or  polygon),  the  ele- 
vation Zv,  indicators  of  pseudo  or  non-pseudo  contour  line,  and  all 
parameters  required  to  describe  the  approximation  element.  It  is 
practical  to  order  the  set  of  e,-  into  a matrix  as  follows  begin- 
ning with  the  lowest  (largest  area)  go  the  next  higher  z^  con- 
tained in  the  same  area  and  forth  to  the  highest  contour  line 
contained  in  the  first  area.  Then  build  the  second  group  from 
the  lowest  z^  where  there  were  more  than  one  contained  in  a contour 
line.  Continue  until  all  sets  are  used.  For  concave  portions 
of  the  surfaces  one  should  start  from  the  contour  line  that  covers 
the  largest  area  and  go  up  and  down  similar  to  the  above.  The 
order  in  which  the  groups  (sets  of  ei ) are  counted  is  irrelevant. 
The  ordering  in  the  above  fashion  is  computationally  expedient. 

To  obtain  the  z coordinate  for  a given  point  A one  proceeds  as 
follows.  Check  the  e^  (starting  with  the  first  group)  for  the 
element  that  contains  the  point  A,  using  the  respective  formulas 
above.  If  such  an  ep  is  found  one  proceeds  to  the  next  higher 
Bp+i  of  the  same  group  until  one  finds  the  eo  that  do  not  contain 
it.  One  interpolates  then  between  Bq  and  eo-i.  If  the  next 
higher  elevations  are  in  another  group  that  contains  the  point, 
one  follows  that  group. 

When  large  numbers  of  points  are  needed  that  are  close  together 
a simple  adaptive  procedure  can  be  used.  The  highest  element 
that  contained  the  previous  point  is  saved  and  the  next  point  is 
tested  against  it  first.  If  the  new  point  satisfies  the  test  no 
further  search  i^s  needed.  If  the  new  point  does  not  satisfy  the 
same  test  with  e^ , one  proceeds  to  the  next  lower  one  and  starts 
a full  search  if  necessary.  If  the  consecutive  points  to  be 
analyzed  are  close  together  the  searches  are  very  short  therefore 
saving  considerable  computer  time.  (See  also  Ref.  1) 
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AFFkOXIKATIC:.  EV  SlFEFACt  SJPLR^OSFT IDN 

In  this  c^proach  lo  approxir.ite  e given  surface,  a set  of  sur- 
faces is  s'jperir.pcsed  in  such  a way  that  for  any  given  point  P(x.y) 
the  highest  of  or.e  or  more  overlapping  surfaces  defines  the  result 
surface.  This  results  in  a unique  surface  as  long  as  the  elenent 
surfaces  are  convex  with  respect  to  the  base  plane  2=0.  If  con- 
cave surface  eler.ents  are  used  a limitation  with  respect  to  z must 
be  civen  such  that  the  "ends"  do  not  protrude  (Fig.  7). 


I- i g.  7 


In  principle,  any  kind  of  analytically  describable  surface  can  be 
used  as  an  approximation  element,  however,  as  in  the  case  of  contour 
line  approximation  we  will  restrict  ourselves  to  quadratic  or  poly- 
quadratic^  surfaces.  First,  the  general  shape  of  such  surfaces  if 
very  clear  and  therefore  more  suitable  for  empirical  composition  of 
a surface,  and  second  using  quadratic  surfaces  results  in  explicit 
formulas  for  line  of  sight  determination. 


Given  a set  of  surfaces 
is  defined  as 


2^  = f^(x,y)  then  the  resulting  surface 


(10)  Maxf^(x,y)  = Z(x,y) 
with  f,^(x,y)  = 2* 


i = 1,  2,  3...n 

kCi 


-Folyquadratic  in  this  case  is  quadratic. 
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for  ori:-se  y!  tf.it  ore  contsye  crc  r.ove  tr.e  Mmt  o’.  All 
vi'ues  cf  X,  y and  t r'ust  be  ree']  ve'iuet.  Wr.er.ever  an  ''  f-(x,y) 
produces  a corrip'iex  z value  for  a civen  point  {y.r  ,y^)  the  value 
is  to  be  excludec  frorr.  the  .".axirur:  searcn  (10). 

To  apprpxi.r.ate  a given  surface  by  this  rethod  one  can  structure 
tne  search  for  the  proper  set  of  f^-(x,y)  iteratively. 

if  z = r(x,y)  is  the  given  surface  to  be  approxiriated  then 

;li/  Z'  ' = r.aX  C F f X . V 1 - 


- ''v-^-ll 

i ■ i 


F(>:,y) 


\y,y)  = .J'.,  M?xf^.  ^'"^(x.y)  + 


Z^°'=0 


where  Cfp(x,y)  is  the  rer.aining  error  after  the  ir.-th  approximation. 
This  method  is  conve-oent  in  the  general  case  considering  a bread 
choice  of  f.:(x,y)  and  that  F(x,y)  is  real,  one-valued  and  contin- 
uous over  the  considered  area.  In  practice,  however,  with  succes- 
sive iterations,  the  structure  of  the  residue  surface  tends  to  gain 
complexity  due  to  the  repeated  diffe''encinc  process.  Therefore, 
it  is  generally  better  to  change  the  first  and  second  set  off.j(x,y) 
rayh^r  than  trying  to  attain  higher  accuracy  by  adding  more  sets  of 
f.(’^)(x,y}  which  will  also  increase  the  computational  effort.  This 
is  com-ensurate  with  the  objective  of  the  approximation  method  to 
achieve  a reasonable  approximation  with  relatively  few  parameters, 
ouP  riot  CD  acmeve  uemose  accuraev'  wtch  c.he  aoG'tC'ion  of  a larce 
num.ber  of  base  daca. 


As  stated  be’fore,  any  non-singular  continuous  function  can  be 
used  as  an  approxir.ati on  ele.-ent.  For  practical  reasons  we 
will  rescrict  the  cncice  to  quadratic  (polycuadretic)  surfaces, 
we  will  use  su:-face  elements  that  are  expressed  by  the  following 
equacior, 

(12)  z>T/£~  r yVb-  + (z/c)'  = 1 r = 1 or  2 

~r.e  r':us  sig''s  and  ;=2  represent  an  elliosoic,  : = 1 an  elliptical 
oaraoolC'id.  Tne  minus  signs  describe  similar  surfaces  but  the}' 
are  concave  with  respect  to  tne  x-y  plane,  see  Fig.  9. 

The  basic  form  (12)  is  achieved  by  a coorcinace  transforimation 
similar  to  (1),  however  three  dimensionally.  V.'e  will  assume  that 
the  coordinate  systems  of  the  approxinati or,  eles'ents  have  their 


:-cxes  parallel  to  the  coordinate  systerr.,  hence  the  approx itT,ate 
directional  cosines  are  zero.  We  can  therefore  handle  the  approxi- 
iTiation  elenients  with  the  same  two-dimensional  vector  s^-  as  in 
equation  (1).  ’ 

(13)  s’  = (x.y)  = (s*^  - sp-T^ 

and  the  approximation  surface  is  defined  as 

(14)  ZCx'^.y'")  = Max  f^(x,y)  = Maxc^.(lis’  ' s’) 

To  utilize  the  approximation  elements  the  relevant  parameters 
s?  and  the  coefficients  of  the  matrix  t.  have  to  be  determined. 
Since  one  knows  only  the  surface  that  needs  approximiating  these 
parameters  have  to  be  derived  from  the  given  set  of  contour  lines. 
The  relationships  for  maximum  elevation  and  minimum  elevation 
to  be  considered  by  an  element  are  evident  from  Fig.  8 and  Fig.  9. 
With 


Fic.  8 


equation  (12)  this  leads  to 

(15)  a2  = ii^K''/(H''  - h^)  P = 1 or  2 

The  minus  sign  applies  to  concave  surface  elements.  The  same 
formula  applies  for  b in  the  y''-z  plane.  Note  that  the  plane  depicted 
in  Fig.  8 and  Fig.  9 is  the  transformed  roordinate  system  going 
through  the  center  of  the  ellipsoid  or  otner  approximation  element. 
When  convex  and  concave  surface  elements  are  used  it  is  important  to 
note  that  curvature  of  the  concave  element  must  be  less  than  all 
adjacent  ones,  see  also  Fig.  6.  A sufficient  criterion  for  this  is 
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< 


Kin(a^/e^ ,b^/b^) 

V V \»  V 


for  all  adjacent  surfaces(index  v)  and  the  "axes”  of  the  concave 
surface  (index  c). 

This  criterion  is  cunbersome  to  use  in  its  exactness.  For 
practical  purposes  it  is  usually  quite  obvious  whether  the  concave 
i surface  is  flatter  than  the  adjoining  surfaces. 

; Polvoua dratic  Surfaces 

■( 

If  in  equation  (12)  p=l/2  the  approximation  surface  is 
(16)  Z(x°,  y°)  = Maxf  . (x.y)  = Max[L  . + (!-?’ •$’)’'] 

i ’ . . i 

2 < /.  V-V  < 1 


The  profile  of  such  surface  in  the  x-z  plane  is  shown  in  Fig.  10. 
For  Si-S''  <0  we  have  a partially 
concave  surface  and  its  extent  is  not 
easily  evident  when  used  in  actual 
approximations.'  The  larger  >,  the 
steeper  the  curve.  For  practical 
purposes  it  is  best  to  restrict  the 
choice  to  >.=2  and  ;,=4.  As  Fig.  10 
shows  one  can  use  L to  move  the  point 
of  zero  curvature  further  up,  as  long 
as  L<h. 

The  relation  between  a and  a is  given 
by  equation  (17). 

Fio.  10 


^We  are  talking  here  about  obvious  shape  without  having  done  a 
point  by  point  computation  of  course  any  point  of  that  surface 


be  computed  exactly  if  desired. 
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(17)  a2 


c = H-L 


\ 

j 


= i2(H-L)P/[(H-L)P  - (h-L)P] 


The  formula  for  is  identical  to  (17)  with  the  respective  H 
and  b. 


LINE  OF  SIGHT  DETERMINATION 

For  applications  of  surface  approximations  to  terrain  in  con- 
junction with  combat  models  it  is  of  primary  importance  to  know 
whether  the  straight  line  between  two  points  on  the  surface  is 
obstructed  (intersected)  by  other  parts  of  the  surface.  Because 
that  computation  has  to  be  repeated  numerous  times  it  is  imperative 
that  the  computation  be  simple  and  short. 

For  contour  line  approximation  the  algorithm  is  briefly  as 
follows.*  The  parametric  representation  of  the  line  between  two 
points  with  the  position  vectors  and  Pg  is 

P = ’’a  * (Pb  - '’a>^- 

This  equation  is  transformed  according  to  equation  (1)  and  then 
T is  determined  by  using  equation  (2)  and  similarily  by  equations 
(3)  or  (4)  with  respect  to  other  approximation  elements.  If  Tj  or 
T2  of  the  intersection  is  between  0<T<1  and  z = + (zb-z;^)Ti^2 

is  less  than  the  elevation  of  the  intersecting  contour  line,  the’ 
line  of  sight  is  obstructed.  If  none  of  the  approximation  elements 
(contour  lines)  is  obstructing,  line  of  sight  exists.  This  involves 
a search  algorithm.  It  can  be  shortened  for  numerical  purposes  by 
starting  the  search  for  an  adjacent  point  by  checking  results 
(obstruction  element)  of  the  last  point  first,  since  it  is  likely  that 
the  same  contour  obstructs  again  (Ref  1). 


Fig. 11 

♦Detailed  explanation  is  given  in  Ref  2. 
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For  superpositioned  surface  the  line  of  sight  can  be  determined 
as  follows  using  the  ellipsoid  as  an  example 

With  p^  = (x^.y^.z^)  and  p = p^  + (Pg-P;^)!  and  s^  = (s°.s°,s°) 

one  obtains 


q = (Hi-  (P-y  . (E)  = 


for  Tj  and  Tj  Tj  2 

D2  = a(l-Y)  + e?,  a = q|^ 


Ti  0 

0 

0 0 C"^ 

= (+D-3)/a 

• ^ ■ ^BA 


r - r = ^ 


Y = qi 


The  conditions  for  line  of  sight  are: 
D2  0 

D2  > 0 either  T,  or  both  0<T<1 


No  Obstruction 
Obstruction 


For  elliptical  paraboloids  the  same  conditions  apply,  however  a,  B 
and  Y are  different  with  respect  to  the  contributing  z-  components. 

For  polyquadratic  surfaces  enveloping  elliptical  paraboloids  are 
used  as  a first  approximation,  which  is  then  refined,  if  necessary, 
for  certain  conditions.  A detaileu  discussion  is  given  in  Ref  2; 
it  goes  beyond  the  scope  of  this  paper. 

RESULTS 

The  methods  of  surface  approximation  have  been  applied  to  some 
real  terrain.  Fig.  12  shows  a graph  of  contour  line  approximations. 

A comparison  of  digital  data  and  various  degrees  of  approximation 
by  sur.ace  superposition  was  made  for  a terrain.  The  results  are 
shown  in  Table  1 below.  Note  that  the  given  values  for  the  mean 


Approx.  Elements 


Mean 


Std.  Deviation 


6 

2.5% 

12 

0.5% 

16 

0.1% 

19 

0.2% 

5.7% 

4.1% 

3.7% 

3.6% 


•••  h Terrain  Comparison 

4r9  taken  from  the  difference  between  approxi- 
i«'a.  and  related  to  the  maximum  elevation.  The 
' » it  ’ 4ntfi  of  the  data  that  the  digital 


L 


f 

f 

t 

r 

(■ 

I 

\ 

\ 


representation  requires  to  achieve  the  deviation  of  only  3.6%  in 
the  above  case. 

Figure  13  and  Figure  14  show  a relief  and  a plot  of  contour 
lines  for  the  Steinbach  Terrain,  this  case  was  also  used  in  the 
comparison  shown  in  Table  I. 

Figure  15  shows  a line  of  sight  map  of  the  same  terrain,  i.e., 
all  blank  parts  of  the  terrain  are  visible  from  the  observation 
point(*).  The  digits  indicate  elevation  intervals  in  multiples  of 
50  meters. 
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Bifurcation  at  Multiple  Eigenvalues. 


Suppose  we  want  to  investigate  the  bifurcation  of  solutions  of  a 
system  of  equations 


G(X,u)  - 0 (1) 

in  the  neighborhood  of  a Known  solution  • Assume 

L “GCXju)  isa  Fredholm  operator  of  index  0 and  that 
o u o o 

n * dim  Ker  ^ X*  Then  by  the  Lyapounov  - Schmidt  method^  we 

can  reduce  the  bifurcation  problem  to  that  of  solving  a system  of 
algebraic  equations 

Fj^{X,  z^,...,z^)  -=0  i = 1,  • • . ,n  . (2) 

If  G(X,u)  is  an  analytic  operator  then  the  are  also  analytic. 

In  practice  the  con^jutation  of  even  the  lowest  order  terms  of  the 
F^  is  a non-trivial  matter,  especially  if  (l)  is  a particiilarly  complicated 
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aystOD  of  equations,  as  In  elasticity  or  fluid  aechanics.  Moreover,  given 
n equations  in  n unknowns  virtually  anything  can  happen  in  the  way  of 
solution  sets  and  their  stabilities,  and  the  algebraic  problea  can  be 
quite  conplex.  There  is  a natural  assuaq'tlon  we  can  sake  in  such  prob- 
lems which  not  only  allows  us  to  bypass  the  nvaerioal  difficulties 
inherent  in  the  l^apounov  - Schsddt  procedure  but  which  also  provides 
ua  with  a general  approach  to  bifurcatian  at  multiple  eigenvalues  and 
with  a way  of  classifying  bifurcation  points  at  multiple  eigenvalues.  1 
vill  assme  that  the  mapping  G is  covariant  with  respect  to  a trans- 
formatioD  group  > . ITiat  is,  let  be  a representation  of  > ■ 

T • T T 

and  assisse  that 


0(X,u)  - 0(X,TgU) 


(3) 


'nils  is  a natural  assisaption  in  physical  theories  and  is  a mathematical 
expression  of  the  axion  that  the  equations  of  mathematical  physics  be 
independent  of  the  observer. 

Frcn  (3)  it  follows  that 


Tg  G^(X,u)  = G^(X,  TgU)Tg 


so  if  is  a solution  which  happens  to  be  invariant  under  the  entire 


group  , T u = u , we  have 
® go  o' 
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4 


T L - L T 
go  o g 


where  L • G (X  ,u  ) . Therefore  H - ker  L Is  invariant  under 
o u o o o o 

T . and  T is  a finite  diaensional  repreaentation  of  J . Let 

g gl  "jj 

■e  write  the  bifurcation  equations  (2)  in  the  fora  F(X,v)  ■ 0 where 

v(N  and  r:CxR->N  . 

O O o 

nteorea  1.  If  O(X.u)  is  covariant  and  T u ■ u then  so  is 
' I — — ' — ■ — — A o o ~~  — 

f-  T^f(x,v)  - r(X,T^v)  . (See  (X)) 

Let  as  now  expand  F in  a power  series  in  v: 


r(X,v)  • A(X)v  ♦ Bp(X,v,<)  ♦ Bj(X,v,v,v) 


Ti>en  we  Bust  have 


T A(X)  - A(X)T 
g g 


Tg  B^(X.v.w)  - B^(X.TgV,T^w) 


Throughout  this  talk  1 will  make  the  assumption 


H : N is  irreducible  under  T 
2 o g 


From  (U)  and  it  follows  (by  Schur's  lensna  [3])  that 


(M 


(5) 


A(X)  = o(X)l 
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Trtiere  I is  the  identity.  Suppose  for  convenience  X =0  and 

o 

a(X)  * X + Cg  X^ +. . . . Then  by  various  scaling  arguments^^^  the 
bifurcation  problem  can  be  reduced  to  an  analysis  of  the  equations 


\(w) 


where  is  the  first  nonvanishing  term  in  F , homogeneous  of  degree 

k . Equations  (6)  are  called  the  reduced  bifurcation  equations.  It  can 
be  shown  ((5l  Theorem  ’’.2)  that  the  stability  of  the  bifurcating  solutions 
can  be  determined  to  lowest  order  from  an  analysis  of  the  Jacobian  of 
(6)  at  a solution. 

T^e  group  theoretic  approach,  then,  is  to  compute  the  lowest 
nonvanishing  term  , find  all  solutions  of  (6),  and  determine  their 
stability  in  the  neighborhood  of  the  branch  point. 
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Today  I will  show  how  the  above  program  can  be  carried  out  in  a 
specific  case,  namely  when  the  group  in  question  is  0(3)  t the  group 
of  rotations  of  the  sphere.  There  are  a nimober  of  classical  bifurcation 
problems  in  which  this  sitxiation  arises,  for  example  the  buckling  of 
a perfectly  uniform  spherical  shell,  or  the  onset  of  convection  in  a 
spherical  mass.  The  latter  problem  and  its  possible  connection  with 
convection  in  the  earth's  mantle  and  plate  tectonics  has  been  discussed 
recently  by  F.  Busse.^^^ 

Our  first  task  is  to  construct  the  2nd  or  3rd  order  covariant  mappings 

B(v,w)  or  B(u,v,w)  . It  is  in^jortant  to  keep  in  mind  that  these  are 

completely  symmetric  mappings:  B(v,w)  = b(w,v)  . 

The  irreducible  reps  of  S0(3)  a.re  of  dimension  2t+l  t •t=0,l,... 

and  are  denoted  by  . To  get  quadratic  mappings  we  consider  the 

tensor  product  acting  on  N ® N . Let  us  recall  the  Clebsch- 

(31 

Gordon  series  , p.  233) 

D'*'  ® D'^  = 0^^  e ©. . .©  D°  (7) 

■niis  means  that  N ® N decomposes  into  a direct  sxmi  of  invariant  irre- 
ducible subspaces  V^: 

N ® N = ^ e. . .©  V*'  ©. . .©  V° 

and  precisely  one  of  these  subspaces,  namely  v"^  , transforms  like  . 
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Since  we  require 


B(D'^v  , D'^r)  = B(v,w) 


this  is  precisely  the  one  we  want. 

Now  in  the  above  deconqsosition  contains  symmetric  tensors, 

anti-synmetric  tensors,  and  so  forth.  Accordingly  will 

be  symmetric  iff  -t  is  even  . Therefore 


for  odd  t there  is  no  quadratic  term 


and  we  must  go  to  cubic  terms. 

Let's  consider  the  case  of  even  -t  . I will  construct  the  mapping 
B using  Lie  algebra  methods  which  are  well  known  in  the  theory  of 
angular  momentum  coupling  in  quantum  mechanics.  Since  B can  be  assumed 
to  be  conpletely  symmetric  we  can  work  with  polynomials  and  write 


^i^*l ^n^  “ ^ijk  \ 


We  begin  with  the  infinitesimal  generators  of  the  rotation  groi^ 


These  satisfy  the  commutation  relations 
f’^i’  " 'idk  '^k  ’ 
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i.;  the  con5)letely  anti-symmetric  tensor.  Now  put 


where 


e 


= ± + i Jj_ 


These  operators  satisfy  the  commutation  relations 

[J"^,  J"3  = 2 , [J^,  J^]  = ± . (8) 

Ihe  operators  are  the  familiar  ladder  operators  of  quantum  mechanics. 

Let  N be  an  irreducible  (real)  invariant  vector  space  which  transforms 
like  under  S0(3)  • Then  the  ccanplexified  space  N + iN  has  a 
basis  f^  such  that  (see  [3],  Chapter  7) 


3 ni 


m 


f 

m 


± m 


— ft  ^ 

^±ffl  m ± 1 


f 

m 


(-1)^"“  f 


(9) 


where 


-l<m<t  and  P =^(-1-111)  (t-Hn+l) 
— — m 


This  basis  may  be  constructed  using  the  commutation  relations 
(8) . We  now  represent  N as  the  vector  space  of  lineeu:  polynomials  in 
z ,...,z  , where  the  variables  z act  as  the  f in  (9)  . Denote 


by  K[z_^, . . . ,z^]  the  ring  of  polynomials  in  the  independent  variables 
z_^,...z^  . K is  isomorphic  to  the  algebra  of  symmetric  tensors  over 
N . (There  is  a natirral  correspondence  between  tensors  over  N and 
multilinear  transformations  of  N into  itself;  See  [U])  . We  now 
extend  the  operators  and  to  be  derivations  over  K : 


J(af  + Sg)  = a Jf  + p J 

o 

J(fg)  = f J + (Jf)^ 
& & 


(10) 


where  f , g 6K  and  a and  6 are  scalars.  It  is  natural  to  extend 
the  J's  in  this  way  since  they  are  Lie  derivatives. 

I apologize  if  I have  lost  the  reader  with  what  may  seem  to  be 

meaningless  algebraic  abstractions;  but  now,  using  (9)  and  (lo),  I am 

ready  to  present  a sin^jle  algorithm  for  the  construction  of  the  polynomials 

Fm(z  ^,...z^)  . The  procedure  should  be  familiar  to  those  readers  who 

have  studied  angular  momentum  coupling  in  quantum  mechanics.  We  require 

the  F to  transform  in  the  same  way  as  the  z : 
m m 


J_  F = mF 
3 m m 


J F = B F 
± m ±m  m ± 1 


Consider  first  the  quadratic  terms  in  F . (The  linear  term  of  F 

m ' m 

is  a scalar  times  z ; since  the  representation  is  irreducible  the 
m 

linear  term  must  be  a scaleir  multiple  of  the  identity. ) For  quadratic 
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Furthermore  this  condition  gives  us  a set  of 

linear  equations  for  the  coefficients  Q , • For  example,  in 

o t/2 

the  case  I,  = 2 we  have 


Fg  = a ^2  Zq  + b Zi 


*^+^2  " ^ ^2  ^1  ^1  ^2  ^3 


= (a  Pq  + 2b  ^3^)23^22  ^ ° ’ 


a P + 2b  P = 0 . 

O 1 
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Once  F is  known  we  get  F,  ^ from 


and  so  forth.  In  this  way  we  can  construct  all  the  F 's  . 

m 

This  procedure  extends  immediately  to  higher  order  terms.  For 
example  to  get  3rd  order  terms  we  write 


F = L A.  z . z . z, 
^ i+j+k=t  ijk  1 J k 


and  apply 


= 0 


to  get  a linear  system  of  equations  for  the  A.  . For  1 = 1 there 
is  only  one  solution  but  for  t=3  there  are  two  independent  solutions. 
In  fact,  the  condition  F^  = 0 leads  to  five  equations  in  seven 
unknowns. 

For  the  quadratic  case  the  general  mapping  is  given  in  terms  of  the 
Clebsch-Gordon  coefficients 


F (z  .,...z.)  - Z 't>m)z  z 

“ ^ m ^ “1  “2 


“l-^ 


or  the  Wigner  3-J  coefficients 


„ / I I I \ 

= (-1)“  E ( ) 

» ^ Vm^  -m/ 


m^-Hi^=n 


z z 

m^  mg 


Much  is  knovm  about  the  3 ■ J symbols,  since  they  are  of  prime  in^portance 
in  atomic  spectroscopy.  Here  is  a generating  function  for  them 
(13],  p.  261) 


III.  special  Results. 


Let  me  discuss  some  special  results  which  can  he  obtained  for 


low  ^ = 1,2,3,^  . 


1.  i,  = l (vector  fields).  The  reduced  bifurcation  equations  are 


X = a z^(z^  - 2 z^  z_^) 


X z = a z (z  - 2 z z ^ ) 
o o o 1-1 


For  real  solutions  we  require  z^  = (-1)  z ^ . The  entire  (non-trivial) 
solution  set  is 


zl^2\z/ 


The  parameter  a depends  on  the  specific  physical  problem.  We  must 
have  X/a  > 0 for  bifurcation,  so  the  bifurcation  is  supercritical  if 
a>0  and  subcritical  if  a < 0 . By  rescaling  we  can  assume  X/a=l  . 
Then  the  orbit  of  solutions  is 


z = cos  6 , z = e^  , z 

o 1 ^ -1 


sin  e -i  cp 


These  solutions  axe  all  axisymmetric.  They  are  orbitally  stable  when 
they  appear  supercritically  and  unstable  when  they  appear  below  criticality. 
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Again  we  get  only  axisymmetrlc  solutions  (Busse  [1])  . Ibis  time 
the  branch  Is  transcrltlcal.  The  eigenvalues  of  the  Jacobian  of  the 
reduced  equations  are 

3,0,-l,0,3 


The  two  zeroes  eure  a consequence  of  the  rotational  invariance  of  the 
eqxiation  and  the  fact  that  the  orbit  is  two  dimensional.  There  is  one 


unstable  mode  subcritlcally  and  two  unstable  modes  si^ercritically.  We 


stable 


therefore  have  in  this  case  the 


stable 


\ins  table 


possibility  of  hard  buckling  (snap- 
through  instability)  . 


unstable 


3.  t=3  • The  interesting  thing  about  this  case  is  that  there  are 

two  independent  covariant  mappings  viz. 


2 z^z_^  + 2 Z2Z_2  - 2 


and 


This  means  the  bifurcation  equations  take  the  form 

X z = A F(z)  + B g(z) 

where  the  parameters  A and  B depend  on  the  external  physical  constants 
of  the  problem.  This  situation  occurs  in  the  B^nard  problem  and  gives 
rise  to  mechanisms  for  pattern  selection{5] , [6]  . 


U.  . (Busse^^^)  There  are  many  solutions  to  the  bifurcation 

equations  in  this  case.  Two  special  ones  are 

1)  axisymmetric  solutions:  z =...=  z,  =0,  z to.  The 

±1  o 

eigenvalues  of  the  Jacobian  are 
inn^OgOlOlO;^^ 

The  axisymmetric  solutions  are  th\is  unstable  on  both  sides 
of  criticality,  with  3 unstable  subcritical  modes. 

2 ) octahedral  symmetry 


This  solution  was  found  by  Busse.  The  eigenvalues  are 


0»  0, 0, 


^ 20  20  20  5 5 

rj  * 'f  * * 


This  solution  thus  has  one  unstable  subcritical  mode. 


Here  are  the  quadratic  polynomials  for  the  case  -t=4: 


i 
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„ 1 1 .32 

Fi  = — z,  z - — z_  z,  + — - — z„ 

^ ° ^ ^ aynr  2 


.,1  3 1 

F_  = — Z|  z T — Zo  Z + Z„  Z- 

3 72  **  -1  2 75  3 ° yilT  ^ ^ 


_ - _ 1 n 3 2 

F„  = z.  z Zo  2 , Z z + z, 

2 -2  TUT  3 -1  14^  2 o 





1 6 9 

F,  = Z),  2 o + Zo  2 o 1 + 2 Z 

-3  TUT  3 -2  7^  2 -1  7720  1 ° 


1 3 

F = — 2,  z , + — ^ z z - 

° 75  ^ 2v^  3 -3  11,^5 


11  9 

z z — z z 

2 ,1.  1 -1 


li+75 


14^5  ° 


The  others  may  be  obtained  from  the  relation 


. Busse  [l]  also  found  special  solutions  for  these  cases  too. 
Bvisse  used  an  extremum  principle  to  determine  the  physically  relevant 
solution.  I win  discuss  that  principle  in  the  final  section.  It 
indicates  that  the  axisymmetric  solutions  are  not  the  physically 
relevant  ones.  Busse  conjectures  that  for  i,=6  the  relevant  solution 
has  the  symmetry  of  a dodecahedron. 
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The  extremtm  principle  I referred  to  above  is  the  following.  For 
t even  the  reduced  bifurcation  equations  can  be  written  in  the  form 


X z 

m 


^ I' 

m — c 


(n 


where 


r 

m --C 


(-1) 


m 


■Hn=0 


] z z 
“l  “l 


Consider  the  function 


P(z) 


1 

3 


-I 


F ^rr. 

m m 


(“D”  F 


m 


z 

-m 


1 

3 


m^-w^-Hn^ 


=o'^“l  “3 


) 


z z z 
“h.  “3 


For  t even  the  Wigner  coefficients  are  cc«i5)letely  symmetric  and  there- 
fore 


F (z z ) = 

m -I  1 5z 

ZD 


Consequently  ovr  reduced  bifurcation  equations  have  a gradient  structure 
and  this  fact  is  independent  of  the  structure  of  the  original  equations 
G(X,u):  It  is  a purely  group  theoretic  result. 


[ 


Therefore  our  bifurcation  equations  (ll)  are  the  Euler -Lagrange 
equations  for  the  minimax  problem 

Min  p(z) 

|z|  =1 

where 


Z z z 
mm 


Z (-1)“  z z 

m -m 


The  function  p is  a third  order  invariant  for  0(3)  • That  is 
p(T  z)  = p(z)  for  aU  g 6 0(3)  . In  terms  of  the  infinitesimal 

s 

generators  this  is  equivalent  to 


p = J^P  = 0 . 


The  norm  |z|  is  the  second  order  invariant. 

Leon  Green  (School  of  Mathematics,  University  of  Minnesota)  and 
I have  succeeded  in  casting  the  bifurcation  problem  in  a slightly 
different  way.  Consider  the  Clebsch-Gordon  series 


® ©. . .©  D° 


and  the  associated  representation 


(12) 


u^A  = D''''^^(g)  A D^'^^(g‘^) 


i 
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on  (t+1)  X (-1+1)  symmetric  matrices  A . This  representation  is 
unitary  relative  to  the  inner  product 

(A,B)  = tr  a . 

Furthermore,  the  third  order  invariant  (there  is  only  one)  is 
P(A)  = i tr  A^  A*  . 

Now  the  highest  weight  space,  the  one  that  transforms  like  in 

(12),  consists  of  symmetric  tensors  (A=A  ) so  we  may  rephrase  our 
bifurcation  problem  as 

Min  i tr  A^ 

subject  to 

2 

tr  A =1  and  tr  A B . 

J 

where  the  B . are  symmetric  matrices 
J 

invariant  subspaces.  In  particular, 
we  get  the  bifurcation  equations 

A^  = X A + 7 I . 

So  far  we  have  only  been  able  to  apply  this  approach  in  the  case  -1  = 2. 
(which  we  have  solved  completely) ; but  it  is  interesting  because  of  its 
similarity  to  L.  Michel's  approach  to  symmetry  breaking  problems  in 
physics.  [2]  . 


= 0 

which  lie  in  the  lower  weight 
tr  A I = tr  A = 0 . For  1=2 
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ORDINARY  DIFFERENTIAL  EQUATIONS  IN  INFINITE 
DIMENSIONS  AND  ACCRETIVE  OPERATORS 
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Madison,  Wisconsin  53706 

ABSTRACT.  In  the  last  ten  years  a nonlinear  theory  of  evolution  problems  which 
has  applications  in  classical  ordinary  differential  equations,  nonlinear  diffusion 
problems,  Stefan  problems,  control  problems  as  well  as  many  other  areas  has  been 
developed.  This  is  the  sense  in  which  this  theory  has  applications  and  the  basic  re- 
sults of  the  theory  are  described. 

INTRODUCTION . There  is  a nonlinear  "theory"  of  evolution  problems  which  provides 
existence,  uniqueness  and  continuous  dependence  results  for  a spectrum  of  problems 
ranging  from  some  in  classical  ordinary  differential  equations  to  others  involving  the 
heat  equation,  the  wave  equation,  nonlinear  diffusion,  a single  conservation  law,  the 
Stefan  problem,  quasi-variational  inequalities  of  evolution  as  well  as  many  more.  An 
expository  introduction  to  this  subject  is  given  in  [5]  while  (1)  provides  a develop- 
ment in  depth.  However,  these  sources  both  need  updating  in  view  of  recent  developments. 


The  current  paper  is  intended  to  supplement  these  sources  by  summarizing  the  basic 
results  while  incorporating  some  recent  advances  from  the  literature.  Section  1 con- 
sists of  a brief  informal  discussion  of  the  relationship  of  the  abstract  results  of 
Section  2 to  applications.  Section  2 summarizes  the  results  of  interest  rather  tersely 
it  should  be  read  as  a supplement  to  [5]  or  [1]. 

1.  Orientation.  In  classical  ordinary  differential  equations  one  studies  initial- 
value  problems  of  the  form 

+ A (u)  =0 

(1) 

I u(0)  = 


where  A maps  a subset  D(A)  of  IR  ” into  IR".  Under  mild  conditions  it  is. proved 
that  (1)  has  a unique  solution  on  some  time  interval  and  that  this  solution  depends 
nicely  on  A and  u^  in  various  senses.  These  results  imply  that  models  met  in  many 

.applications  are  "well-posed".  In  these  models  u represents  the  state  of  the  system 
under  consideration.  For  example,  u might  be  a list  of  numbers  giving  the  positions 
and  velocities  of  a finite  collection  of  particles.  We  are  going  to  discuss  some  exis- 
tence, uniqueness  and  continuous  dependence  results  for  (1)  where  A is  a mapping  in 
some  Banach  space.  The  relationship  of  these  results  to  applications  is  very  much  the 
same  as  in  the  classical  case.  That  is,  many  applied  models  involving  partial  differ- 
ential equations  can  be  abstracted  in  the  form  (1)  where  A is  an  operator  in  am  in- 
finite dimensional  Bangtch  space  (rather  than  in  some  ) and  A will  satisfy  the 

assumptions  made  in  the  next  section.  Thus  the  eibstract  theory  provides  us  with  the 
ability  to  think  about  a wide  spectrum  of  problems  simultaneously  as  well  as  basic  facts 
about  their  solutions.  Hov/ever,  just  as  the  classical  well-posedness  theory  does  not 
give  any  detailed  information  about,  for  example,  the  solutions  of  the  three  body  pro- 
blem, the  theory  discussed  here  does  not  give  detailed  information  about  particular 
solutions  of  particular  problems. 


Under  mild  conditions  it  is. proved 
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We  begin  by  sketching  how  particular  problems  in  partial  differential  equations 
can  be  regarded  as  special  cases  of  the  abstract  Cauchy  problem 


CP(A,f,Ug) : 


+ A(u)  = f(t) 
u(0)  * u- 


In  CP(A,f,u^),  A maps  its  domain 


D(A) , which  is  a subset  of  a Banach  space 


X (or  A:D(A)  C X X),  f:(0,T)  -*•  X is  a strongly  integrable  function,  T > 0 
garded  as  fixed  hereafter  and  u^  e X. 


X,  into 
is  re- 


As  an  example, 
f 


(NDP) 


we  choose  the  nonlinear  diffusion  problem 
^ - (k(u)u^)^  = f(x,t),  X c (0,1),  t > 0 

u (0,t)  = u (l,t)  =0,  t > 0 

X X 

u(x,0)  = Uq(x) , X « (0,1) 


In  (NDP)  we  may  regard  the  unknown  function  u(x,t)  as  the  temperature  of  a one-dimen- 
sional rod.  The  rod  has  insulated  ends,  a temperature-dependent  conductivity  k(u)  and 
is  being  heated  externally.  Let  us  think  of  (NDP)  as  an  equation  describing  how  the 
whole  temperature  field  varies  as  t varies.  That  is,  we  will  regard  (NDP)  as  telling 
us  the  rate  of  change  of  the  function  t -*■  u(*,t)  which  assigns  to  the  time  t the 
"state"  of  the  rod-namely,  the  temperature  field  at  that  time.  We  write  u(t)  for  this 
temperature  field.  The  ways  of  thinking  "t  -►  u(t)"  and  the  classical  "(x,t)  -►  utx,t)“ 

are  related  by  u(t) (x)  = u(x,t).  A good  state  space  for  this  problem  is  X = L^(0,1). 

The  requirement  that  u(t)  e L^(0,1)  simply  corresponds  to  the  heat  energy  in  the  rod 
being  finite  at  the  time  t.  We  rewrite  the  equation  - (k(u)u^)^  “ f(x,c)  in  the 

following  way:  Replace  u by  ^ since  we  now  are  thinking  of  u as  a function  of 

t dt 


t whose  values  are  in  X.  In  the  same  spirit  we  replace  - (k(u)u  ) by  A(u)  where 

XX 

A(v)  = - (k(v)v')'.  In  words,  to  compute  A(v) , where  v is  a function  of  x,  differ- 
entiate V,  multiply  the  result  by  - k(v)  and  differentiate  again.  The  expression 
f(x,t)  will  likewise  be  replaced  by  f(t)  where  f(t)  e X for  each  t.  Now  the  equa- 
tion u^  - (k(u)u  ) = f(y,t)  is  abbreviated  to  du/dt  + A(u)  = f(t). 

t X X 


The  boundary  conditions  (BC)  are  handled  by  incorporation  into  the  domain  of  A. 

Set 


D(A)  = {v  € X : V and  v'  are  absolutely  continuous 
and  v' (0)  = v' (1)  =0} 

That  is,  every  function  in  D(A)  satisfies  the  zero  flux  condition  at  x = 0 and  x = 1. 
With  these  identifications  the  abstract  Cauchy  problem  CP(A,f,Ug)  contains  all  the  in- 
formation (NDP)  contained.  This  rewriting  of  (NDP)  in  the  above  form  attains  significance 
only  when  we  have  information  about  the  solutions  of  CP(A,f,U|j)  which  is  of  interest  for 

(NDP) . Such  information  is  the  topic  of  the  next  section. 


Section  2.  One  of  the  most  basic  and  general  approaches  to  the  solvability  of 
CP(A,f,UQ)  involves  approximation  by  implicit  difference  schemes.  An  implicit  differ- 
ence approximation  to  CP(A,f,Ujj)  on  (0,Tl  is  defined  by  a partition 

{0  “ t_  < t,  <•  • •<  t < T)  of  10,t  ),  a corresponding  sequence  (g.).  < X and  a 

0 1 n n 1 1 = 1 


488 


starting  value  e X.  A solution  of  the  approximation  defined  by  {t^  < t^^  <•••<  t'  }, 

{q,}"  and  x„  is  a piecewise  constant  function  v: (0,t  1 -►  X satisfying 
1=1  0 n 


v(t)  = v(t^)  for  t c 


(1) 


v(0) 


v(t^)  - v(t^_j^) 

t.  - t.  , 

1 1-1 


+ A(v(t^) )=  for  i = l,2,-**,n 


The 

implicit  difference  approximation 

is  E-approximate  to 

CP(A,f,U|j)  on  (0,TI  if 

( 

(i) 

0 < T “ t < e. 

- t^  £ E for  i 

= l,2,»**,n  , 

— n — 

j 

n ti 

(2) 

i 

1 (ii) 

I 4 

- g.  lldt  < E , 

i-l  ^i-1 

(iii) 

11 X - U II  < E 

v 

" 0 0 — 

The 

fundtion 

v:(0,t  1 
n 

-►X  is  an  E-difference  approximate 

solution  of  CP(A,f,u  ) on 

(0,T1  if  it  is  a solution  of  an  e-approximate  implicit  difference  scheme  on  10, Tl . 
We  have  : 

Theorem  1 (Convergence  of  solutions  of  difference  approximations) ; Assume  there  is  an 
ID  € » such  that 


(3) 


|x  - x + A(A(x)  - A(x))  II  ^ (1"X(d)  II  X - x| 
for  x,x  € D(A)  and  X > 0 


Let  Up  € D(A)  , T > 0 and  v^^  be  ^ E^^-dif ference  approximation  to  CP(A,f,Up) 

10, T)  where  lim  e,.  = 0.  Then  there  is  exactly  one  continuous  function  u:[0,Tl  -»  X 
k-~  ^ 


such  that  lim  ||v,  (t)  - u(t) 
k-«>  ^ 


0 uniformly  on  compact  subsets  of  [0,T] . 


Assumption  (3)  is  the  only  restriction  on  A,  replaci 
and/or  compactness  one  is  accustomed  to.  Here  some  simple 
A is  Lipschitz  continuous  with  constant  L,  then  id  = L 
nonincreasing  then  id  = 0 works;  if  X is  a Hilbert  space 
unbounded,  self-adjoint  and  A ^ 0,  or  A is  skew-adjoint 
holds  one  says  that  A ♦ idI  is  accretive  or  -(A  + wl)  is 
tial  operators  and  spaces  in  which  they  have  accretive  real 


in 

L*’,  1 £ 

p ^ (b) 

A(u)  = -A(u°),  a > 0,  in  L^  and 

n 

3 1 • n 

- — (g.(u))  in  L , (e)  A(u)  «-  T 

3Xi  1 

in 

L",(d) 

A(u)  - [ 

i»l 

in 

L^,  1 _< 

p £ « . (f) 

OD 

A(u)  = g(qrad  u)  in  L . Theorem 

nq  the  (Lipschitz)  continuity 
examples  for  orientation:  If 
works:  if  X = F and  A is 
and  A is  linear,  perhaps 
, then  ID  = 0 works.  If  (3) 
dissipative.  Some  differen- 
izations  are:  (a)  A(u)  = -A u 

H ^,(c)  A(u)  = -(Au)**,  a > 0, 


3 

3x . 


3u 

3x.  ' 
1 


q-1 


3u 

3x. 

1 


for  q > 1 


is  proved  in  (61 


• « * 

Assuming  that  the  hypotheses  of  the  above  theorem  are  satisfied,  we  denote  the  limit- 
ing function  u whose  existence  is  asserted  by  K(A,f,Up).  One  simply  defines 

u » K(A,f,Up)  to  be  the  "solution"  of  CP(A,f,Up)  on  10,T1  if  K(A,f,Up)  "exists" 

(i.e.  the  hypotheses  of  the  theorem  hold:  in  particular  (3)  holds  and  Ej^-dif ference 

approximate  solutions  v^  with  Ej^  -»  0 exist)  . 


JO. 
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i 


It  would  seem  natural 


to  discuss 


the  relationship  between  the  notion  "K(A,f,u  1 

0 

of  solution  of  CT(A,f,u^^)  and  more  traditional  ideas  of  a solution^ which  require  the 

existence  of  derivatives,  next.  However,  it  is  more  convenient  to  turn  to  the  question 
of  when  K(A,f,Ug)  exists  first. 


Everywhere  below  we  assume  that  (2)  holds  and  f: [0,T) 


is  strongly  integrable. 


The  simplest  condition  which  guarantees  the  existence  of'  KlA.f.u^)  is: 


[There  exists 


(4) 


> 0 such  that  R(I+XA)  = X for  ,0  < X < X , 
0 < 0 


[where  R{I+XA)  is  the  range  of  I+Xa 


Indeed,  then  every  implicit  difference  approximation  to  CP(A,f,U|^)  on  (0,T] 


with 


max(t.  - t._^)  < X^ 


has  a solution.  For,  given 


and  g^,  we  can  (by  (4))  choose 


v(t^)  .so  that  v(t^)  + (t^ 


t,  ,)A(v(t.))  = v(t.  ,)  + (t.  - t.  ,)g.  and  hence  satisfy 
1-1  1 1-1  1 1-1  1 


X-  = u. 
0 0 


( 1 ) . If  e > 0,  then 
with  t 

I 

and  letting  (t, ,t-,‘*',t  ) be  the  nodes  of 
12  n 


satisfies  (2)(iii)  and  we  can  satisfy  (2) (i)  and  (ii)  ' 

T 

T by  choosing  a step  function  g:[0,T]  -*■  X satisfying  |!  f ( i) -g  (t  ) [I  di  < c 


t.  - t.  < E) 
1 i-l  - 


and 


g.  = g(t^-). 


If  f 


g (with  points  added  as  needed  to  achieve 
we  can  choose  g^  ■ 0 and  t,  = iT/n  for 


t,  = iT/n 
1 


n large  enough  so  that  T/n  ^ E.  if  a is  accretive  (i.e.(3)  holds  with  w = o)  and 
R(l  + ‘,A)  = X for  ' ' 0,  then  A is  called  m-accretive . This  case  frequently  occurs 
in  applications. 


Much  more  subtle  conditions  than  R(I  + >A)  = X guarantee  the  existence  of 
E-differonce  approximate  solutions.  For  example,  if  f ? 0 one  h.as : 

Theorem  2.  Let 

lim  inf  ' ^ distance  (R(I+  A),x)  - 0 for  x '■  D(A) 

X ♦ 0 

Then  K(A,0,u^)  exists  tor  every  u^  e D(A) . 


This  theorem  is  proved  in  IB).  Requiring  that  K(A,f,UQ)  exist  for  every  integrabl 
f:[0,T]  X and  u^  E D(A)  implies  that  R(I+  A)  = X for  > > 0,  and  Xw  < i (if  the 

graph  of  A is  closed).  The  proof  of  tliis  fact  involves  showing  that  K(A,f,UQ)  exists 
whenever  u^  £ D(A)  and 

(f(t)  £ (y:lim  inf  ^ ^ distance  (R( I+^A) ,x+'y)  = 0 
> ^ 0 

for  x £ D(A) ' for  almost  all  t £ (0,T)  , 

generalizing  Theorem  2.  See  (S  1 concerning  a generalization  of  Theorem  2 to  cover  a 
range  of  quite  different  possibilities. 

We  now  discuss  the  relationship  between  K(A,f,UQ)  and  more  classical  notions  of 

solutions  of  CP(A,f,U|^).  A function  u:(0,Tj  X is  a strong  solution  of  CP(A,f,UQ) 

on  (0,T1  if:  (i)  u(0)  = u , (ii)  there  is  an  integrable  function  w:[0,T)  X such 

that  u(t)-u(s)  = w(i)dr  for  t,s  £ (0,T)  (this  condition  on  u will  be  abbreviated 

to  u ' w'''So,T:X)  anil  imiilies  u'  exists  and  u'  = w a.e.),  and  (iii) 
u'(t)  + A(u(t))  = f(t)  a.e.  t ■ (n,T|  . We  have: 


Theorem  3. (a)  Let  CP(A,f,Up)  have  a strong  solution  u o^  10, T] . Then  K(A,f,UQ) 
exists  and  u = KCA.f.u^) . 

(b)  ^ K(A,f,Up)  exist  and  u = K(A.f.u^)  c W^'^CO.TtX) . If  the  graph  of  A is 
closed  and  (5)  holds , then  o ^ ^ strong  solution  of  CP(A,f,Ug) . 

(c)  Let  u = K(A,f,UQ)  exist , e D(A)  and  f:(0,T]  -*■  X be  of  bounded  variation. 

Then  u is  Lipschitz  continuous . 

(d)  In  addition  to  the  conditions  of  (c) , assume  that  X is  reflexive  and  the  graph  of 
A ^ closed.  Then  u i£  a strong  solution  of  CP(A,f,U|j)  on  I0,T)  . 


I • 

The  proof  of  Theorem  3(a)  uses  the  fact  that  integrable  functions  can  be  well  appro- 
mated  by  Riemann  sums.  See  [7,  Sections  4 and  10]  for  a more  oeneral  result.  Similarly, 
[7,  Section  10)  can  be  adapted  in  order  to  prove  (b) . The  result  of  (c)  can  be  deduced 
from  the  estimate 


(6)  e ||u(t)  - v(t)  |l  - e ||u(s)  - v(s)  ||  1 e ||f(T)  - g(T)  H dx 

which  holds  whenever  u = K(A,f,u(0)),  v = K(A,g,v(0))  and  0 ^ s ^ t < T.  Finally, 

(d)  follows  from  the  fact  that  if  X is  reflexive,  then  u:[0,Tl  -*  X is  in  W^'^(0,T;X) 
iff  u is  absolutely  continuous  and  (b)  . 

The  final  type  of  result  we  want  to  consider  is  the  dependence  of  K(A,f,UQ)  on  the 
data  A,f,UQ.  Inequality  (6)  implies  that  K(A,f,UQ)  is  Lipschitz  continuous  in  f and 
Up  uniformly  for  A satisfying  (3).  Var"inq  A also,  one  has; 

Theorem  4 . Let  ^ a sequence  of  operators  satisfying  (3)  with  the  same  u and 

R(I+1A^)  = X for  small  \ > 0.  £f  y • X and  x^  ♦ • y for  n = 0,1,2,  im- 

plies  lim  x » x„,  then  1 im  K(A  ,f,u  _)  = K(A,f,u  ,)  uniformly  on  (0,T1  provided 

n-»"  n 

that  u „ t D(A  ) and  lim  u „ = u„„ . 

nO  n nO  00 

n 

Thus  K(A,f,UQ)  depends  continuously  on  the  initial  data,  the  forcing  term  and  the 

the  equation.  For  example,  within  the  context  of  our  nonlinear  heat  flow  model,  one  could 
interpret  changing  the  conduction  coefficient  k('i)  as  varying  A,  and  then  deduce  that 
the  solution  of  (NDP)  depends  continuously  on  k(u).  A proof  of  Theorem  4 can  bo  found  in 
(2).  Recently,  this  continuous  dependence  result  (in  a somewhat  different  form,  see  [4]) 
was  used  to  explain  in  a simple  way  a numerical  method  for  the  Stefan  problem  (|3)). 
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We  approximate  (1.1)  by  the  familiar  difference  formula 

(1.2)  u(x  + h,y)  + u(x  - h,y)  + u(x,y  + h)  + u(x,y  - h)  - 4u(x,y)  s 0 . 

The  error  involves  fourth  derivatives  of  u.  We  suppose  that  these  exist 
and  are  reasonably  well  behaved.  This  is  usually  the  case,  so  for  a long  time 
it  has  been  customary  to  seelc  an  approximate  solution  for  (1.1)  by  solving  the 
set  of  linear  equations  resulting  from  using  various  values  of  (x,y)  in  (1.2). 
If  h is  small,  so  that  the  error  in  (1.2)  is  small,  the  solution  of  (1.2) 
gives  good  approximations  to  the  values  of  u at  a set  of  grid  points. 

Unfortunately,  if  h is  small,  then  one  has  a very  large  number  of 
linear  equations  to  solve,  and  the  labor  of  computation  is  very  great.  Until 
the  advent  of  the  computer,  one  compromised  by  using  a fairly  large  value  of 
h,  to  curtail  the  calculation,  but  one  had  to  be  content  with  not  a very  good 
approximation. 

One  effort  to  use  computers  to  improve  this  situation  is  embodied  in  item 
[11  of  the  Bibliography,  by  Kantorovich,  Krylov,  and  Chernin.  If  one  has 
values  for  u prescribed  on  the  boundary  of  a rectangle,  the  tables  in  [1] 
allow  one  to  get  fairly  quiclcly  the  solution  of  (1.2)  inside  the  rectangle. 

(This  can  now  be  done  more  quickly  by  means  of  the  Fast  Fourier  Transform,  so 
that  [1]  is  now  obsolete.) 

To  show  the  effectiveness  of  their  tables,  the  authors  of  [1]  undertook  to 
find  the  solution  of  (1.2)  inside  an  L-shaped  region  (see  Fig.  1).  They 
prescribed  values  for  u around  the  boundary,  and  used  the  Schwarz  alternating 
procedure.  Specifically,  they  first  guessed  values  along  CF.  Using  these 
with  the  boundary  conditions,  the  tables  gave  the  solution  of  (1.2)  inside  the 
rectangle  ABFG.  In  particular,  they  gave  values  along  HC.  Using  these  with 
the  boundary  conditions,  the  tables  gave  the  solution  of  (1.2)  inside  the 
rectangle  HDEG.  This  led  to  a better  guess  for  the  values  along  CF.  Using 
the  better  guess,  the  process  was  repeated.  After  a modest  number  of  repeti- 
tions, the  procedure  converged  to  give  about  six  decimal  accuracy. 


1 


Figure  1 


This  was  the  solution  of  (1.2)  inside  the  L-shaped  region.  As  noted,  tlie 
error  of  (1.2)  as  an  approximation  to  (1.1)  depends  on  fourth  derivatives  of  u. 

In  [2]  and  [3],  which  appeared  six  to  four  years  before  [1],  Wasow  and  Lehman 
had  shown  that  in  the  neighborhood  of  a reentrant  corner  (such  as  C in  Fig.  1) 
one  should  expect  to  have  an  unbounded  first  derivative.  With  an  unbounded 
first  derivative,  one  cannot  expect  good  behavior  from  fourth  derivatives.  So, 
in  the  neig)iborhood  of  C in  Fig.  1,  one  should'  expect  the  solution  of  (1.2) 
(which  was  obtained  in  [1])  to  be  a fx^or  approximation  to  the  solution  of  (1.1)  . 

2.  Two  lemmas.  To  get  some  comprehension  of  the  results  of  VJasow  and 
Lehman  in  [2]  and  f3]  , we  use  two  leiraaas,  which  we  state  here  without  pr>.iof. 

Lemma  2.1.  Let  a < b.  Let  f (a''  = f (b)  = 0.  Let  f (x)  have  aliiiost  every- 
where a second  derivative  for  a ^ ^ 1.  which  is  of  bounded  variation.  Then 
tlie  Fourier  series  for  f (x)  in  the  interval  a ^ x ^ b, 

CO 

/->  1 » , V • ''^m(x  - a) 

(2.1)  f(x)  = ) D sin  — r 

m b - a 

in=l 

wi  th 

_ b , , 

• 2)  D = ; ] f (x)  sin  — r dx  r 

m )i  - a ■'  b - a 

a 

converges  very  rapidly,  and  a largo  number  of  t)ic  D can  be  calculated  very 
quic)cly  by  means  of  the  Fast  Fourier  Transform. 
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By  "converges  very  rapidly"  is  meant  that  goes  to  zero  at  least  of 

the  order  of  m Thus  one  can  truncate  the  series  on  the  right  of  (2.1)  after 
500  terms  and  reasonably  expect  to  get  from  six  to  eight  significant  decimal 
places  correct.  And  the  Fast  Fourier  Transform  will  enable  one  to  calculate  the 
needed  500  coefficients  very  quickly.  The  reasoning  to  establish  this  lemma  is 
given  in  pp.  6-8  of  [4] . 

Lemma  2.2.  Let  u(x,y)  be  harmonic  in  a region,  part  of  the  boundary  of 
which  is  a straight  line  segment.  Let  u(x,y)  = f (s)  on  this  straight  line 
segment,  where  s is  length  along  the  segment.  Let  f (s)  and  its  first  n 
derivatives  be  continuous,  and  let  the  (n  + l)-st  derivative  he  bounded  and 
continuous  except  at  a set  of  points  of  measure  zero.  Then  each  partial  deriva- 
tive of  u(x,y)  of  order  ^ n has  a continuous  extension  to  the  straight  line 
boundary. 

Thm.  2.3  on  p.  27  of  [5]  states  this  for  a special  case.  The  truth  of  the 
lemma  in  general  follows  easily  from  the  special  case. 

In  the  present  report,  we  shall  confine  our  attention  to  the  case  where  the 
proscribed  values  of  u around  the  boundary  are  quite  smooth;  say  tliat  the  third 
derivative  is  )x)unded  and  continuous  except  at  a set  of  measure  zero.  It  is 
planned  to  v;rite  a sequel  to  [5]  explaining  how  to  handle  a variety  of  irregular- 
ities along  the  boundary.  Certain  sorts  of  irregularities  that  could  occur  along 
the  boundary  can  be  "removed"  by  the  methods  given  on  pp.  221-222  of  16).  So  it 
does  not  seem  unduly  restrictive  to  confine  our  attention  to  harmonic  functions 
u(x,y)  wliich  are  quite  smooth  around  tlie  boundary.  In  the  present  report  we  do 

Let  u(x,y)  be  such  a harmonic  function  in  the  L-shaped  region  of  Fig.  2. 


(2,2) 


(2,0) 


Figure  2 


Indeed  we  will  shortly  specialize  to  prescribing  that  on  the  boundary  in  Fig.  2 
we  will  have 

(2.3)  u(x,y)  = j Jtn{{x  + 1)^  + (y  - 1)^}  . 

In  Fig.  2 we  have  shown  three-quarters  of  a circle  of  radius  A and  center 
at  the  origin.  We  undertake  to  determine  the  behavior  of  u(x,y)  inside  the 
three-quarters  circle. 

(s) 

Choose  u (x,y)  a function  which  in  the  interior  of  the  figure  is  harmonic 
in  the  neighborhood  of  the  sides  BC  and  CD,  including  the  three-quarters 
circle  (one  can  take  A quite  small  if  need  be) , and  which  takes  the  seune  values 
along  BC  and  CD  that  are  prescribed  for  u(x,y).  Instructions  for  finding 
(s) 

such  a function  u (x,y)  are  set  forth  in  [5] . If  we  have  prescribed  the 
particular  boundary  conditions  (2.3)  for  u(x,y),  such  a function  is 

(2.4)  fn(z  + 1 - i)  + fn(z  + 1 + i) 

+ ln(z  - 1 - i)  - £n(z  - 1 + i) } , 

where  we  have  taken 

(2.5)  z = X + iy  . 

It  can  easily  be  verified  that  the  right  side  of  (2.4)  satisfies  (2.3)  along 
the  entire  x-axis  and  the  entire  y-axis.  Also  (2.4)  is  harmonic  except  at  the 
four  points  z = ± 1 ± i.  To  keep  the  three-quarters  circle  inside  the  region 
where  (2.4)  is  harmonic,  it  suffices  to  take  A < /2.  We  choose  such  a value 
for  A,  and  proceed. 

(s) 

Since  u(x,y)  and  u (x,y)  take  the  same  values  along  BC  and  CD,  we 
conclude  that  along  these  two  seginents 

(2.6)  u(x,y)  - u^®^ (x,y) 
is  zero. 

Measure  the  angle  0 as  usual,  counterclockwise  around  the  origin  from  CD. 

On  the  three-quarters  circle,  we  will  have  (2.6)  a function  of  0 only,  since 
we  have  fixed  A.  Call  this  f(0).  As  u(x,y)  and  u'®^(x,y)  take  values 
along  BC  and  CD  tliat  are  differentiable  an  infinite  number  of  times,  it 
follows  from  Lemma  2.2  that  f (0)  is  infinitely  dif f erentialjle  as  0 approaches  j 

0+  and  (3Tr/2)-.  For  0 < 0 < 3ti/2,  f(0)  is  the  difference  of  two  harmonic 
functions,  and  hence  is  infinitely  differentiable.  So  for  0 £ 0 3ii/2,  f(0) 

has  derivatives  of  all  orders. 

As  (2.6)  is  0 on  BC  and  CD,  we  have  f(0)  = f(3'n/2)  = 0.  So  we  take 
a = 0 and  b = 3ti/2  in  Lemma  2.1,  and  conclude  that  f(0)  has  a rapidly  converg- 
ing Fourier  series  expansion  for  0^05.  3it/2;  put  0 for  x in  (2.1)  . 

Because  of  the  rapid  convergence,  we  see  that 
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(2.7) 


is  a harmonic  function  for  0 < r < A and  0 < 6 < 3Tr/2.  It  equals  f (6)  for 
r = A,  and  is  0 for  0 = 0 or  0 = 3tt/2.  But  (2.6)  satisfies  these  same 
conditions.  As  a harmonic  function  is  uniquely  determined  in  a region  by  its 
values  around  the  boundary,  we  must  have  (2.6)  equal  to  (2.7)  inside  and  on  the 
three-quarters  circle.  Solving  for  u(x,y),  we  must  have  u(x,y)  equal  to  the 
sum  of  (2.4)  and  (2.7)  inside  and  on  the  three-quarters  circle. 

For  the  particular  boundary  conditions  which  we  have  chosen  (see  (2.3))  we 
have  ^ 0 in  (2.7).  So  if  we  fix  a value  of  6,  0 < 0 < 3it/2,  and  approach 
tlie  origin  along  that  ray,  (2.7)  will  have  an  infinite  first  derivative.  As  (2.4) 
is  harmonic  inside  the  entire  circle  of  radius  A and  center  at  the  origin  (we 
took  A < \/2) , it  has  well  behaved  derivatives  of  all  orders  at  C.  So  u(x,y) 
must  have  an  infinite  derivative  as  r approaches  ;!ero. 

Of  course,  if  it  had  turned  out  that  = D = D.  = = 0,  then  u(x,y) 

would  have  had  well  behaved  fourth  derivatives. ■ But  it  did  not  turn  out  that 
way.  In  (2]  and  [3],  Wasow  and  Lehman  made  a study  of  the  asymptotic  behavior 
of  harmonic  functions  near  reentrant  corners.  Their  studies  were  quite  general, 
covering  curved  boundaries  and  a wide  variety  of  conditions.  The  series  tlioy  got 
were  only  asymptotic,  but  scries  like  (2.7)  were  typical  (except  that  (2.7) 
converges)  . Indeed,  we  are  lucky  witl'i  our  particular  problem,  in  that  our 

( s)  ( s ) 

u (x,y)  is  harmonic  in  the  neighborhood  of  the  corner.  More  generally,  u (x,y) 

contributes  additional  complications,  such  as  terms  involving  logarithms. 

In  view  of  this,  one  wonders  why  the  authors  of  [1)  managed  to  get  such  good 
results  near  the  reentrant  corner.  This  came  about  as  follows.  In  order  to  be 
able  to  check  if  their  procedure  was  giving  tlie  right  answers,  they  took  a problem 
in  which  the  answers  were  known.  They  ciiose  u(x,y)  a function  tliat  was  well 
behaved  over  a much  larger  region  than  that  shown  in  Fig.  1.  From  it,  they  read 
values  around  the  boundary,  and  proceeded  to  solve,  getting  back  u(x,y)  of 
course.  Since  tliey  started  witli  a function  that  was  well  behaved  over  a large 
region,  including  the  reentrant  corner,  they  insured  that  = 0 

in  (2.7).  So  of  course  they  had  well  behaved  fourth  derivatives,  and  (1.2)  was 
an  excellent  approximation  to  (1.1),  and  their  answers  agreed  closely  with  the 
true  values.  Very  comforting  for  them,  but  very  misleading  for  the  reader.  Had 
they  used  the  boundary  conditions  (2.3),  their  answers  would  have  been  very  poor 
near  C.  On  the  other  hand,  they  probably  did  not  have  a way  to  get  the  correct 
answers  for  the  boundary  conditions  (2.3),  and  so  would  not  have  known  if  they 
had  good  answers  or  not. 

3.  The  soltition  inside  a rectangle.  This  brings  us  to  tlie  crucial  ques- 
tion. How  docs  one  got  correct  answers  with  boundary  conditions  like  (2.3)? 

First  we  have  to  have  a technique  for  carrying  out  a solution  inside  a rectangle, 
which  we  now  explain.  Given  a rectangle  with  smooth  boundary  conditions  prescribed 
around  its  perimeter,  how  does  one  determine  a u(x,y)  wliich  is  harmonic  in  the 
interior  and  takes  the  prescribed  values  on  the  perimeter? 
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Figure  3 


Consider 

•k 

(3.1)  u (x,y)  = u(x,y)  - A - Bx  - Cy  - Dxy  . 

This  is  harmonic,  and  has  smootli  boundary  conditions.  It  is  easy  to  choose  A, 

B,  C,  and  D so  that  u (x,y)  takes  the  value  zero  at  each  corner  of  the 

rectangle.  Along  tlio  top  let  u (x,y)  = f (x)  , for  a < x < b.  Cur  choice  of 

A,  B,  C,  and  D assures  tliat  f (a)  = f (b)  = 0..  Also,  as  we  were  assuming  smootli 

boundary  conditions,  lot  us  say  that  that  assures  that  f (x)  )ias  almost  every- 
where a second  derivative  for  a ^ x ^ b which  is  of  bounded  variation.  So 
f(x)  satisfies  the  conditions  of  Lemma  2.1.  We  get  its  Fourier  expansion,  (2.1), 
with  the  D defined  by  (2.2).  Consider 


. , irm(y  - c) 

“ sinh  — , . 

, , V r,  b - a . iim(x  - a) 

u.  (x,y)  = 1 D r sin  — 

t m . , Timh  b - a 

m=l  sinh  ; 

b - a 


whore  h is  t)ie  Ijeiglit  of  the  rectangle  and  c is  the  value  of  y at  the 
bottom  of  the  rectangle.  Clearly  u^(x,y)  is  harmonic.  It  is  zero  along  the 


left  side  of  the  rectangle  (x  = a) , it  is  zero  along  the  right  side  of  the 
rectangle  (x  = b) , it  is  Zero  along  the  bottom  of  the  rectangle  (y  = c) , and 


it  equals  f(x)  along  the  top  of  the  rectangle;  that  is,  on  tlie  top  it  agrees 
with  u (x,y)  . Vie  carry  out  an  analogous  construction  for  eacli  of  the  other 


tliree  sides  of  the  rectangle,  and  add  together  the  resulting  four  series.  Since 
the  sum  agrees  witli  u (x,y)  on  the  entire  perimeter,  it  )ias  to  bo  equal  to 
u (x,y)  throughout  the  rectangle.  Tlien  w’e  determine  u(x,y)  from  (3.1). 


Armed  with  tliis  tcclinique,  let  us  return  to  the  problem  of  Fig.  1.  Values  j 

u(x,y)  have  been  prescrilied  around  t)io  boundary  (for  example,  see  (2.3)).  i] 

guess  values  along  CF.  Wit)i  the  lioundary  conditions,  t)iis  gives  values  j 


around  the  perimeter  of  the  rectangle  ABFG.  As  described  just  above,  we  get  a 
harmonic  function  inside  this  rectangle  which  takes  the  prescribed  boundary  condi- 
tions. It  gives  us  values  aloiig  HC.  With  these  and  the  boundary  conditions, 
we  have  values  around  the  perimeter  of  the  rectangle  HDEG.  From  these,  we  get 
values  in  tlie  interior,  including  along  the  line  CF.  This  will  bo  an  improvement 
over  our  first  guess. 

We  repeat  the  process.  In  an  actual  calculation,  with  the  conditions  (2.3), 
it  took  about  fifteen  iterations  for  convergence.  However,  because  the  Fast 
Fourier  Transform  gets  the  D very  quickly,  the  calculation  to  convergence  did 
not  take  very  long.  However,  it  did  not  converge  to  tlie  u(>:,y)  we  were  seeking. 
Recall  that  in  Lemma  2.1,  it  was  required  that  f (x)  have  a second  derivative  of 
bounded  variation.  But  the  u(x,y)  defined  by  conditions  (2.3)  has  an  infinite 
first  derivative  as  one  approaches  C along  CF. 

This  seems  too  bad.  However,  the  procedure  we  just  described  is  not  without 
value.  In  fact,  it  will  be  the  one  we  will  use  in  the  end,  but  with  a slight 
modification.  Our  difficulty  (refer  to  Fig.  1)  is  that,  along  the  lines  BF  and 
HD,  the  function  u(x,y)  that  we  are  trying  to  determine  does  not  have  almost 
everywhere  a second  derivative  of  bounded  variation.  If  we  should  try  this 
procedure  on  a u(x,y)  which  does  have  almost  everywhere  a second  derivative  of 
bounded  variation  along  the  lines  BF  and  HD,  we  would  succeed  admirably  in 
determining  that  u(x,y),  and  in  terms  of  rapidly  converging  Fourier  series.  All 
we  need  for  u{x,y)  is  to  know  its  values  around  the  boundary,  and  to  be  assured 
that  it  is  sufficiently  smooth  along  the  lines  BF  and  HD. 


4.  A.  slight  modif  i cat-ion.  Recall  tliat  the  u(x,y)  we  are  seeking  to 
determine  equals  (2.4)  plus  (2.7)  inside  and  on  the  three-quarters  circle,  and 
that  (2.4)  is  very  smooth  along  the  lines  BF  and  HD.  Because  of  this,  we 
will  s)iow  that 


(4.1) 


u(x,y) 


4 


sin 


3' 


has  a second  derivative  of  bounded  variation  along  both  the  lines  BF  and  HD. 
Along  BC  and  CD,  (4.1)  equals  the  riglit  side  of  (2.3),  which  is  very  smooth. 
Inside  and  on  the  three-quarters  circle,  (4.1)  equals  (2.4)  plus  the  remainder 
of  the  scries  (2.7),  which  is  smooth  enough.  And  from  the  three-quarters  cii'cle 
out  to  F or  H,  (4.1)  is  the  sum  of  three  harmonic  functions  out  to  a straight 
line  border  along  which  their  boundary  values  are  infinitely  differentiable;  by 
Lemma  2.2,  all  derivatives  exist  continuously  out  to  the  border. 


If  we  could  somchov/  determine  the  values  of  D^  and  D2/  we  could  determine 

(4.1)  by  t))e  procedure  of  the  previous  section.  We  certainly  can  determine  tlie 
values  of  (4.1)  around  the  boundary:  we  liad  had  to  choose  a value  of  A before 


the  values  of  D^^  and  D^  could  be  defined  (in  fact,  we  had  ciiosen  A = 1 for 


our  calculation),  and  the  values  of  u(x,y)  are  given  by  (2.3).  Also,  as  we 


Lhave  just  carefully  ascertained,  (4.1)  has  a second  derivative  of  bounded 

variation  along  the  lines  BF  and  HD.  Being  given  the  values  of  D^  and  D^ » 
we  could  then  calculate  u(x,y)  from  (4.1). 


So  we  arc  faced  with  the  problem  of  determining  and  02* 


We  remind  the  reader  that  a computer  operates  lineai'ly.  To  calculate  (4.1) 
by  the  procedure  of  the  previous  section,  we  would  get  tiie  same  numerical  answers 
by  either  of  the  two  following  procedures. 

(1)  Apply  the  procedure  to  the  total  function  (4.1). 


(2)  Apply  the  procedure  first  to  u(x,y),  getting  some  Fourier  expansions 


S , then  apply  the  procedure  to 


(4.2) 


2 
3 

— I sin 


26 


getting  some  Fourier  expansions  then  apply  the  procedure  to 


(4.3) 


/ r 40 

(a)  T ' 


getting  some  Fourier  expansions  S , and  finally  combine  the  various  Fourier 
expansions  into 


(4.4) 


D2S 


III 


Although  will  he  a p>oor  representation’  of  u(x,y),  as  we  observed  in 

the  previous  section,  and  and  wil."  be  poor  representations  of  (4.2) 

and  (4.3),  for  similar  reasons,  the  combination  (4.4)  v;ill  be  an  excellent  repre- 
•sentation  of  (4.1),  since  tlie  linearity  of  the  computer  as.sures  that  it  comprises 
the  same  numl:)ors  that  one  would  get  by  applying  the  procedure  of  tlie  previous 
section  to  the  entirety  of  (4.1). 

With  no  more  ado,  let  us  proceed  to  determine  S^,  and  Consid- 
ering and  D2  as  two  (as  yet)  unknown  parameters,  we  can  take  (4.4)  as 

representxng  (4.1).  Subtracting  (2.4)  from  (4.4),  wo  will  have  a representation 
of 


(4.5) 


u(x,y) 


(s) 


(x,y) 


That  is,  using  (4.4)  minus  (2.4),  we  can  actually  calculate  values  of  (4.5)  at 
any  points  of  the  L-shaped  region  of  Fig.  2,  except  tliat  the  values  will  come 
out  as  linear  combinations  of  and  D2. 

Observe  tliat  (4.5)  is  zero  on  both  the  linos  BC  and  CD,  since  (2.6)  was. 
So,  by  the  same  metliod  that  we  used  to  get  the  expansion  (2.7)  for  (2.6)  inside 
and  on  the  tliree-quartors  circle,  we  can  get  an  expansion  like  (2.7)  for  (4.5). 
Obviously,  this  expansion  has  to  consist  of  (2.7)  with  t)ie  first  two  terms 
deleted.  So,  when  we  take  m = 1 and  2 in  (2.2)  to  get  and  D2,  we  must 


get  the  value  zero.  But,  as  the  values  of  f (x)  in  (2.2)  are  taken  from  (4.4) 
minus  (2.4),  tlie  numerical  quadratures  to  dote  mine  (2.2)  must  yield  linear 
combinations  of  and  D^.  Putting  these  linear  combinations  equal  to  zero 

for  m = 1 and  m = 2 gives  us  two  simul  ..neous  linear  equations  for  and 

D^.  We  solve  these.  Putting  the  solutions  into  (4.4)  gives  Fourier  expansions 

for  (4.1).  But  now  wc  know  and  D^,  and  so  can  calculate  u(x,y)  from  (4.1). 


5.  Acknowledgements.  In  a Part  II,  we  will  report  numerical  results  for 
the  problem  considered  above,  and  results  for  other  problems  that  can  be  handled 
by  similar  techniques.  I wish  to  express  my  gratitude  to  Gershon  Kedem  for  his 
assistance  with  tliese  activities.  He  carried  out  the  needed  programming,  and 
supervised  the  actual  calculations.  He  also  suggested  simplifications,  and 
helped  me  get  my  thougtits  in  order  and  keep  track  of  the  details. 
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ADAPTIVE  ACCELERATION  OF  SSOR  FOR  SOLVING  LARGE  LINEAR  SYSTEMS 


Vitalius  BenoVraitis 
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U,  S.  Army  Ballistic  Research  Laboratory 


ABSTRACT . Symmetric  successive  overrelaxation  (SSOR)  for  solving  large, 
sparse  systems  of  linear  equations  involves  the  estimation  of  a para- 
meter (D.  An  adaptive  procedure  is  outlined  for  improving  the  estimates 
for  b)  and  the  spectral  radius  S(S  ) of  the  iteration  matrix  S . These 

estimates  are  then  used  in  the  SSOR  method  with  Chebyshev  acceleration.  The 
objective  is  to  achieve  convergence  in  only  a few  more  iterations  than  would 
be  required  if  the  best  possible  values  of  u and  S(S^)  were  used  from  the 

outset.  The  method  is  applied  to  obtain  finite  difference  solutions  of 
a number  of  generalized  Dirichlet  problems.  In  certain  cases,  the  number 

of  iterations  required  using  the  adaptive  procedure  increases  like  h , 
where  h is  the  mesh  size. 

1.  INTRODUCTION.  We  shall  be  concerned  with  iteratively  determining  the 
N-vector  u of  a large,  sparse  linear  system 

(1)  Au  = b 

where  A is  a real,  symmetric,  positive  definite  matrix  of  order  N and  b is 
a given  N-vector.  Such  systems  arise  in  the  finite  difference  solution  of 
elliptic  boundary  value  problems.  Particularly,  we  shall  develop  an  adaptive 
scheme  based  on  the  symmetric  SOR  (SSOR)  iterative  method  with  Chebyshev 
acceleration.  Related  work  which  has  recently  appeared  includes  Axelsson 
(1972),  Hayes  and  Young  (1977)  and  Young  (1974a,  1974b,  1974c,  1977). 

2.  BASIC  METHOD.  By  defining 


B = I-D‘U  = L 1-  U 


c = D’^b 

where  D = diag  (A)  and  L and  U are  strictly  lower  and  upper  triangular 
matrices,  respectively,  we  may  replace  the  system  (1)  by 


The  SSOR  method  (Sheldon  (1955))  is  then  defined  by  forming  a single  SSOR 
iteration  from  a forward  SOR  iteration  followed  by  a backward  SOR 
iteration;  that  is,  for  n = 0,  1,  2,  ...  we  set 

y(n+*i)  » + c)  + (l”(i))u^^^ 


where  is  an  arbitrary  initial  approximation  to  the  solution  u,  and  u 

is  a real  relaxation  parameter  such  that  0<u)<2. 

Elimination  of  u^”***^  in  (2)  gives 

u(n+l)  . 5 ^(n)  ^ k 

u u 


where 


S 

u 


U L 


(1)  (li 


* j(I-a)U)'^  (aiL  + (1-(d)I)|  | (I-a)L)'^(u)U+(l+u)I) } 

= oi(2-u,)  (I-u>U)-l  (I-o>L)'^c 

Note  that  corresponds  to  the  familiar  SOR  iteration  matrix.  The 
backward  SOR  operator  is  defined  analogously. 

If  storage  for  an  extra  N-vector  is  provided,  the  work  required  for 
one  SSOR  iteration  may  be  reduced  to  about  the  work  necessary  for  a single 
SOR  iteration.  The  work-saving  technique  is  due  to  Niethammer  (1964)  and 
is  described  in  Benokraitis  (1974,  1976)  and  Young  (1977).  The  method  has 
been  rediscovered  by  Conrad  and  Wallach  (1977). 

The  SSOR  method  converges  if  S(S^),  the  spectral  radius  of  the  iteration 

matrix  S^,  is  less  than  1,  which  holds  if  0<ui<2  and  A is  positive  definite. 

The  rate  of  convergence  is  governed  by  the  ordering  of  the  equations  and  by 
the  parameter  u.  Assuming  the  natural  ordering.  Young  (1974a,  1974b,  1977) 
has  shown  that  for  a certain  discrete  generalized  Dirichlet  problem  one  can 
choose  a "good"  m depending  on  bounds  for  the  eigenvalues  of  B and  LU  so 
that  the  SSOR  method  converges  with  the  same  order-of-magnitude  as  the  SOR 
method.  For  a finite  difference  discretization  with  mesh  size  h,  the 
number  of  iterations  required  for  both  methods  increases  like  h"  . 
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Therefore,  even  by  employing  Niethammer's  work-saving  scheme,  there 
is  little  justification  for  using  SSOR.  However,  the  SSOR  method  can  be 
accelerated  by  an  order-of-magnitude  by  means  of  Chebyshev  semi -iteration 
since  the  eigenvalues  of  the  matrix  are  real  and  nonnegative.  (Chebyshev 

semi-iteration  was  first  studied  by  Varga  (1957)  and  Golub  and  Varga  (1961).) 
This  approach  is  precluded  for  SOR  with  optimum  u = since  many  of  the 

eigenvalues  of  L are  complex.  (See  Varga  (1957)  and  Young  (1971).)  Also, 

there  is  no  improvement  when  semi-iteration  is  applied  to  SOR  with  l<a)<Wj^. 

(See  Kincaid  (1974).)  For  accelerating  the  Gauss-Seidel  method  (SOR  with 
(ij  = 1) , see  Sheldon  (1959)  2uid  Young  (1971) . 

3.  ACCELERATED  METHOD.  The  optimum  semi-iterative  method  based  on  SSOR, 
denoted  by  SSOR-SI,  is  defined  by 

(3)  . k„)  . (1-5)  (l-»„,j) 


Here 


- 2-S(SJ 

Pi  = 1 

Pj  - (1  - P^/2)'^ 

2 

Vl  ■ (1  ■ 2.  3. 


where 


a 


S(S  ) 

01 

2-S(S  ) 
0)-' 


In  order  to  apply  the  SSOR-SI  method,  we  must  determine  the  two  parameters  oi 
and  S(S^).  Some  a priori  methods  for  obtaining  these  parameters  are  discussed 

by  Habetler  and  Wachspress  (1961),  Evans  and  Forrington  (1963),  Young  (1974a, 
1974b,  1977),  and  Benokraitis  (1974,  1976).  Since  finding  these  parameters 
may  take  as  much  work  as  solving  the  original  problem,  we  are  led  to  con- 
sider adaptive  techniques  which  approximate  the  parameters  and  at  the  same 
time  improve  the  solution  of  the  linear  system. 


505 


I 


4.  FOUNDATION  FOR  ADAPTIVE  METHOD.  We  begin  by  characterizing  the  eigen- 
values  of  S in  terms  of  certain  inner  products.  This  result  is  due  to 


Habetler  and  Wachspress  (1961).  See  also  Young  (1977). 


THEOREM  1 . Let  \ be  an  eigenvalue  of  S where  0<u)<2  and  let  v be  an  associ- 


ated eigenvector.  Then  X may  be  represented  by 


X = 1 - a)(2-a))  j-  = (P(u,v) 

l-ua*u  8 


where 


(v.DBv) 

(v.Dv) 


(v.DLUv) 

(v.Dv) 


THEOREM  2 . The  representation  ^(u.v)  given  by  (4)  for  any  vector  v 0 is 
a Rayleigh  quotient  with  respect  to  the  vector 


w = (I-u)U)d’^; 


and  the  symmetric  matrix 


5 , = (I-iuU)d’^S  D'*^(I-(oU)'^ 


where 


u = d'^ud’*^  . 


That  is. 


((>  (u.v) 


(w,5,  w) 


(w,w)  • 


Furthermore,  5 is  similar  to  S and 

(I)  0) 


♦ (a),v)  < S(S  ) - S(S  ). 

— W (jJ 


Proof:  See  Benokraitis  (1974). 

We  emphasize  that  (6)  holds  for  any  nonzero  vector  v,  not  just  for  eigen- 
vectors of  S . However,  the  closer  we  approach  a fundamental  eigenvector, 
the  closer  “ we  shall  be  able  to  determine  S(S  ) from  ♦(u,v)  given  by  (4). 
Therefore,  it  would  be  fortunate  if  somehow  we  could  determine  the  fundamental 
eigenvector  without  deviating  from  the  path  of  improving  the  approximate 
solution  of  (1).  A clue  leading  to  the  desired  situation  is  contained  in  the 
following  theorem  (cf.  Young  (1974c)). 

THEOREM  3.  The  pseudo-residual  vector 

(7)  6^"^  » S u^"^  + k - 

where  u^”^  is  the  latest  SSOR-SI  iterate,  satisfies 

(8)  . 


Here 


where  T^(x)  is  the  n^^  degree  Chebyshev  polynomial  defined  by  the  three-term 
recurrence  relation 


T^(x)  - 1,  Tj(x)  = X 

’'n.lW  = 2xT^W  - Vl(x).  n>  1 . 

Proof:  See  Benokraitis  (1974). 

We  note  that  if  is  replaced  by  then  (8)  reminds  us  of  the  power 

method  for  computing  the  dominant  eigenvector.  With  this  motivation,  the  next 
theorem  comes  as  no  surprise. 

THEOREM  4.  The  pseudo-residual  vector  6^”^  given  by  (7)  converges  in  direction 
to  the  eigenvector  associated  with  the  eigenvalue  S(S^)  as  n tends  to  infinity. 


r~ 


I Proof:  See  Benokraitis  (1974).  Also  compare  with  Diamond  (1971)  and  Hageman 

(1972). 

By  Theorem  4 it  is  possible,  then,  to  determine  the  fundamental  eigen- 
vector with  a little  additional  effort  by  computing  the  pseudo-residual  vector. 
However,  since 

• 6^"^  = S uf"5  + k - u^"^  = uf"^  - 

^ U)  u 

one  SSOR  latest  SSOR-SI 

iteration  iteration 

and  since 

u^”'^  = S u^"^  + k 

(i)  0) 

I must  be  computed  as  part  of  the  next  SSOR-SI  iteration  (see  (3)),  the 

I pseudo-residual  vector  is  essentially  obtained  as  a byproduct  of  applying  the 

SSOR-SI  method. 

5.  ADAPTIVE  METHOD.  By  using  Theorems  1,  2,  and  4 as  a foundation  we  are  able 
to  present  the  basic  structure  for  an  adaptive  procedure.  For  detailed  descrip- 
tions of  this  method  and  several  possible  variations  for  the  adaptive  accelera- 
tion of  SSOR,  see  Benokraitis  (1974) . 

We  state  the  steps  of  the  "algorithm"  in  outline  form  with  a synopsis  of 
the  controlling  theorem(s)  in  the  heading.  We  use  the  word  "algorithm"  loosely, 
since  admittedly  much  is  left  unspecified. 

I.  Theorem  2.  For  any  v ^ 0,  (ji(a),v)  ^ S(S^). 

1.  Choose,  Vj,  0. 

2.  Observe 

a.  - iKai.Vj)  ^ S(SJ 


b.  ^2  ■ 

c,  ♦j(m)  - max  (ijij,  ±S(SJ 

Vi,V2 

3.  Minimize  •^(u)  with  respect  to  u to  obtain  estimate  u^. 
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4.  Choose  *,(0),)  as  S^CS  an  estimate  of  SfS  ).  (The  situation 

11  E ojj''’  “l 

is  depicted  in  Figure  1.) 

II.  Theorem  4.  The  pseudo-residual  vector  converges  in  direction  to 

dominant  eigenvector  v. 

1.  Set  i = 1 

2.  Iterate  n times  with  SSOR-SI  with  parameters  )»  test  for 

for  convergence  ^ 

3.  Compute 


6^"^  = (S  n^"^  + k ) - u^"^ 
^0)  u 


which  approaches  dominant  eigenvector. 


4.  Check  if  parameters  should  be  changed. 

a.  If  < Sg  (S  ) do  not  change  parameters.  Go  to  II. 2 

b.  If  4i (tu^ , 6 ) > Sg  continue  to  step  III  to  change  parameters 

III.  Theorems  1,  2,  4.  As  6^"^  approaches  dominant  eigenvector,  6^^^) 

approaches  S(S  I. 

“i 

1.  Set  vj.2  . jf") 


2.  Observe  (u)  = max  ((ij.  •••.  ♦142^ 

Vj^,k*l,  . . . , i+2 

< SCS  ) 
w 

3.  Minimize  with  respect  to  u to  obtain  next  estimate 

4.  Choose  ♦.  ,((D.^,)  as  S_  (S  ),  an  estimate  of  S(S  ).  Set  i » i+1 

1^1  ^^1  ^ “i+1  “i*l 

Go  to  II. 2.  Process  is  continued  until  convergence. 


509 


method  (Young  (1971)).  A strategy  which  produces  acceptable  results  is  to 


choose  n so  that  the  average  rate  of  convergence  after  n iterations  is  90%  of 
the  asymptotic  average  rate  of  convergence.  The  convergence  rates  are  computed 
using  the  latest  estimate  of  S(S^).  That  is,  n is  chosen  to  be  the  least  n 

which  satisfies 


■ n ~ ^ r) 

1+r 


where 


A word  about  the  additional  work  required  in  the  adaptive  algorithm  is 
in  order.  Mainly,  the  added  expense  comes  in  changing  the  parameters.  This 
involves  the  computation  of  a and  S,  two  quotients  of  inner  products  in  the 
formula  for  iKto.v)  given  in  (4)-(5).  For  problems  of  the  type  discussed  in 

2 

Section  6,  to  compute  a and  6 requires  approximately  28J  arithmetic  operations 

2 

if  the  mesh  size  is  h = 1/J.  One  SSOR-SI  iteration  requires  approximately  39J 
operations.  Therefore,  four  parameter  changes  are  approximately  equivalent  to 
three  SSOR-SI  iterations  in  terms  of  work  performed.  Since  no  more  than  four 
parameter  changes  were  required  for  the  problems  considered,  the  number  of 
iterations  for  the  adaptive  algorithm  should  effectively  be  increased  by  about 
three  iterations. 

6.  NUMERICAL  EXAMPLES.  We  present  results  for  a sample  of  the  generalized 
Dirichlet  problems  considered.  The  results  are  given  in  graphic  form  in 
Figures  2-4.  In  each  figure,  we  give  the  differential  equation,  the  region 
considered  and  the  boundary  values.  We  replace  the  differential  equation  by  a 
S-point  symmetric  difference  equation  (see  Young  (1977)). 

The  number  of  iterations  required  for  varying  mesh  sizes  is  recorded  for 
optimum,  adaptive,  and  estimated  SSOR-SI  parameters.  In  the  adaptive  case,  the 
subscript  on  the  number  of  iterations  indicates  the  number  of  parameter  changes 
required.  The  estimated  parameters  are  the  values  of  Young  (1977)  which  depend 
on  bounds  for  the  eigenvalues  of  B and  LU.  (For  Problem  3,  the  results  for  the 
estimated  parameters  are  not  given  since  an  excessive  number  of  iterations  are 
required.)  The  slopes  s of  the  lines  indicate  that  the  number  of  iterations 

required  for  convergence  increases  like  h 


NUMBER  Of  ITERATIONS 


For  smooth  and  some  discontinuous  coefficients 


(Problems  1 and  2), 


the 


_u 

number  of  iterations  required  behaves  like  h , an  order-of-magnitude  better 
than  SOR  or  SSOR.  For  cases  involving  higher  discontinuity  (Problem  3),  the 

behavior  is  like 
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ABSTRACT 


MACSYMA  (Project  MAC  SYmbollc  MAnlpulatlon  System)  Is  used  to 
develop  a number  of  solutions  for  a sample  linear  boundary  value 
problem,  and  results  are  compared. 

A brief  outline  of  MACSYMA  capabilities  Is  given  followed  by 
a general  description  of  the  class  of  problems  treated,  and  the 
specific  boundary  value  problem  of  this  class  used  to  exemplify  the 
application  of  MACSYMA. 

A brief  overview  of  the  approach  to  solution  with  MACSYMA  and 
a MACSYMA  demonstration  of  this  approach  for  one  approximate  solution 
of  the  sample  problem  Is  given. 

Ten  approximations  to  the  solution  of  the  boundary  value  problem 
obtained  using  MACSYMA,  are  compared  with  the  true  solution  by  means 
of  MACSYMA-generated  error  curves. 
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INTRODUCTION 


MACSYMA,  Project  MAC  Symbolic  Manipulation  System  [1],  [2] 
was  developed  under  the  sponsorship  of  the  Advanced  Research 
Projects  Agency  of  the  Dept,  of  Defense  by  the  Mathlab  Group 
of  the  Laboratory  for  Computer  Science  (formerly  Project  MAC) 
at  the  Massachusetts  Institute  of  Technology  where  it  is 
installed  and  maintained.  It  is,  however,  available  to  a wide 
community  of  users  via  ARPANET,  a national  Defense  Communica- 
tions Agency  operated  computer  network.  MACSYMA  is  an  inter- 
active system  that  handles  numeric  as  well  as  symbolic 
manipulation.  Many  applications  of  MACSYMA  are  cited  in  [3]. 


More  than  ten  years 
Applied  Mathematics  Labo 
ics,  and  Logistics  Depar 
Basin  (now  the  David  W. 
opment  Center)  to  exploi 
computers  for  carrying  o 
manipulating  mathematica 
symbolic  manipulation  sy 
system  was  the  IBM  7090, 
of  boundary  and  initial 
arising  in  mathematical 


ago  a project  was  established  in  the 
ratory  (now  the  Computation,  Mathemat- 
tment)  of  the  David  Taylor  Model 
Taylor  Naval  Ship  Research  and  Devel- 
t the  combined  capabilities  of  digital 
ut  extensive  calculations  and  for 
1 operators  in  symbolic  form.  The 
stem  used  was  PORMAC  [4],  the  computer 
and  the  problem  area  was  the  solution 
value  problems,  especially  those 
physics . 


Two  general  approaches  to  the  solution  of  such  problems 

are ; 


a.  The  determination  of  fixed  points  of  operators  by 
means  of  iterative  techniques. 

b.  The  determination  of  a set  of  parameters  which 
will  minimize  in^ some  sense  the  difference  between  the  desired 
solution  and  an  approximating  function.  The  approximating 
function  will  involve  parameters  and  independent  variables. 
Collocation  methods,  least  squares  methods,  finite  difference 
methods,  methods  based  on  Taylor  series  expansion,  Galerkin's 
method,  and  many  others  can  be  viewed  as  being  of  this  type. 

Some  computer  routines  that  were  developed  to  apply  the 
second  of  the  above  techniques  to  linear  problems  are  given 
in  [5].  Tnese  routines  permitted  the  needed  equations 
to  be  stated  symbolically  as  part  of  the  input  (differential 
equations,  equations  required  to  describe  auxiliary  conditions, 
etc.),  together  with  a numerical  specification  of  the  regions 
in  which  they  were  to  be  applied.  The  symbolic  form  of  the 
approximating  function  to  be  used  could  also  be  part  of  the 
input.  Then  an  approximate  solution  function  was  generated, 
which  minimized  an  error  norm  in  the  least  squares  sense. 

This  included  collocation  as  a special  case.  The  routines 
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developed  proved  very  flexible  and  useful  for  curve  fitting 
as  well  as  for  approximating  solutions  of  problems  involving 
differential  equations  despite  the  limitations  of  the 
computers  then  in  use.  It  is  the  purpose  of  this  paper  to 
describe  more  recently  developed  MACSYMA  programs  of  the 
same  family. 

This  paper  contains,  in  addition  to  a brief  overview  of 
some  related  efforts,  a description  of  the  mathematical 
approach  used  in  the  computer  programs  plus  some  examples 
illustrating  their  performance.  The  accuracy  of  the  solutions 
obtained  is  estimated. 

A recent  review  by  Eason  [6]  on  least  squares  methods 
for  solving  problems  involving  partial  differential  equations 
contains  an  extensive  bibliography  on  this  subject.  Among 
the  conclusions  of  the  review  is:  one  of  the  "major  factors 
discouraging  more  wide  spread  use"  of  least  squares  methods 
seems  to  be  "the  presumption  that  least  squares  is  clumsy  to 
apply".  We  hope  that  our  experience  will  help  to  dispell 
that  notion. 

The  MACSYMA  System 

MACSYMA  has  been  described  as  an  automated  mathematical 
co-worker.  This  characterization  finds  justification  in  the 
fact  that  MACSYMA  can  do  many  mathematical  operations,  both 
symbolic  and  numeric,  very  rapidly  when  the  appropriate 
instructions  are  issued  by  its  user. 

Figure  1 lists  some  of  the  mathematical  capabilities  of 
MACSYMA  along  with  features  that  enhance  its  utility.  See 
the  MACSYMA  Reference  Manual  [1]  for  more  details. 

Figure  2 is  a reproduction  of  a sequence  of  instructions 
and  MACSYMA  responses  illustrating  the  symbolic  integration, 
differentiation,  and  simplification  of  a rational  expression . 
Also  included  is  the  Taylor  e:(pansion  in  X about  0 of  V 1+x 
through  the  term  containing  X . Instructions  typed  by  the 
user  are  underlined.  Note  that  % references  the  preceding 
expression. 

The  General  Problem 

The  problem  area  chosen  for  the  application  of  MACSYMA 
is  that  of  [5]  which  can  be  characterized  in  general  as  the 
solution  of 

TV-g»0  (1) 

where  V is  an  element  of  a subset  F of  a linear  space  R 
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Capabilities  of  MACSYMA 
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Interactive  capabilities 
Continuing  system  maintenance 


1 


(C9)  02; 


3 

X * 1 


(CIO)  IWTECDXTECX.X); 


SIN  FXSL  DSK  HACSTM  b«ln<  luadad 
ioadlnj  duna 


SCHATC  PASL  DSK  HACSTH  balng  loadad 
loading  dona 

2 X - 1 

2 ATAN( ) 

L0G(X  - X ♦ 1)  S0ST(3>  t.0C(I  ♦ 1) 

(010)  2 ( - . — ) 

6 S0RT(3)  3 


(C11)  DIFFd.X); 

2 2 X - 1 1 

(Oil)  2 ( — r - ) 

2 2 3 (X  ♦ 1) 

(2  X - 1)  6 (X  - X ♦ 1) 

3 

3 


(C12)  RATSIHPd); 


HAIAT  FASL  DSK  HACSTH  balng  luadad 
luadlng  dunit 

2 3 » 5 

XX  X $ X T X 

(D13)/T/  ....  , 

2 8 16  128  256 


Figure  2 


T is  an  operator  defined  in  a domain  F of  R and 
maps  F uniquely  into  R. 
g is  a specified  element  of  R. 

The  solution  is  approximated  by  a function  w of  a set  of 
parameters  ag,  aj^,...ap  so  that  some  norm  of  the  difference 
of  TV  and  TW  designated  by 

Jtv-tw)  (2) 

is  minimized. 

More  specifically,  let 

LQ(V(x),x)=g(x)  a^x^B  (3) 

represent  linear  differential  equation  with  x on  the  real 
interval  [a  , 6]  with  linear  homogeneous  boundary  conditions. 


L (V(x),x)*0atX*a 
1 


(4) 


L (V(x) , X)  = 0 at  X = B 
2 


(5) 


Assume  a solution  of  the  form 


V = Z a.f,(x) 


i=l 


1 1 


(6) 


where  the  ^ are  functions  which  may  or  may  not  satisfy 
the  boundary  conditions,  i.e. 

= L2(f^(6),6)  =0  i = 1,2, p 


Define 


ej^(a,x)  = a.f.  (x))-g(x) 


(7) 


(8) 


= Z a.L,  (f. (x))-g(x)  for  k = 0,1,2 
i=l  ^ ^ 
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The  problem  is  then  to  find  an  approximate  solution  in  the 
form  (6)  to  (3)  with  boundary  conditions  (4),  (5)  where 
the  a's  are  chosen  so  that  an  appropriate  linear  combination 
of  the  norm  of  (a»x)  in  the  appropriate  domains  will 
be  minimized. 

Solution  techniques 

Two  approaches  to  the  solution,  the  Weighted  Least 
Squares  method  and  the  Galerkin  method  were  used  and  the 
results  compared. 

The  Weighted  Least  Squares  method  can  be  expressed  as 


min 

a 


I 


a)(x)e2(a,x)dx  = min 


f P 


(x))-g(x)fo)(x)dx  (9) 


or  equivalently,  solve  for  a the  systems  of  p linear  equations 
in  the  a's  given  by 


( a)(x)e2(a,x)dx)=0  (k=l,2,. 

Joi 


•P) 


(10) 


which  reduces  to  the  system 


I a.  I w(x)L(f.(x))L(f,  (x))dx  = / u.(x)g(x)L(f,  (x))dx 

1 Ja  IK 


(11) 


(k  = 1,2 p) 

In  a similar  way  the  Galerkin  method  can  be  expressed  as 
the  system  of  equations  given  by 


jf  e(a,x)f,  (x)dx  = 0 (k=l,2, , . . ,p) 

where  the  f]^(x)  satisfy  the 

or  equivalently  boundary  conditions, 

i=l  fa  Lo(fiW)fkWdx  = g(x)f^(x)dx  (13) 

(k=l,2, p) 

Note  the  similarity  of  form  in  the  expressions  for  Weighted 
Least  Squares  and  Galerkin  methods  when  aj(x)=l. 

The  Specific  Example 

The  specific  example  chosen  as  a demonstration  problem  for 
the  class  is  the  following: 
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.dV 


2,,. 


+ 2x—  + (1-x  )’V=  1-x  for  0£  X _<  1 

dx 


(14) 


with  boundary  conditions 


g (0)  = V(l)  = 0 


(15) 


The  exact  solution  is  known  to  be 


V = 1 - e 


»s(x^-l) 


(16) 


Several  approaches  to  finding  approximate  solutions  to 
equation  (14)  subject  to  boundary  conditions  (15)  were  taken. 

An  appraisal  of  accuracy  of  each  approach  to  the  solution  of 
this  problem  was  made  by  comparing  numerical  results  with 
(16)  evaluated  at  selected  points  on  the  interval  (0,1). 

Several  variations  of  the  Least  Squares  solution  and  the 
Galerkin  solution  were  programmed  and  carried  out  in  MACSYMA 
Using  an  approximating  function  of  the  form 

V = r a.  (1-x^*^)  (17) 

i=l  ^ 

Solutions  were  obtained  for  n=4  using  the  continuous  least  squares 
and  the  continuous  Galerkin  methods  i.e.  solving  equations  (11) 
and  (13)  respectively  for  a=o,  6=1.  Solutions  using  approxi- 
mating function  (17)  with  n=4  were  also  obtained  by  the  discrete 
least  squares  method  where  the  integral  form  0 to  1 was  replaced 
by  a weighted  sum  over  the  set  of  points  0(.05).95,  first  by  a 
sum  with  uniform  weight  then  by  a sum  with  weights  of  1/2 
at  the  boundaries  so  that  a more  accurate  approximation  to  the 
integral  is  used. 

In  the  discrete  least  squares  solution  the  system  of 
linear  equations  in  the  function  coefficients,  a^,  was 
carried  out  by  the  Gram-Schmidt  orthonormalization  process 
applying  a technique  used  by  Cuthill  [5]  and  described  by 
Davis  and  Rabinowitz  [ 7]  . The  discrete  least  squares  solution 
was  also  carried  out  using  the  matrix  inversion  embodied  in 
MACSYMA  for  comparison. 

It  should  be  noted  that  the  approximating  functions  (17) 
automatically  satisfy  the  boundary  conditions  (15)  for  the 
problem. 

A second  approximating  function  of  the  form 


I*  o . . 

i=l  ^ 
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was  used  and  coefficients  were  computed  using  the  least  squares 
method.  Since  the  boundary  conditions  (15)  are  not  auto- 
matically satisfied  by  (18)  it  was  necessary  to  give  special 
attention  to  computing  the  coefficients  in  such  a way  that 
the  boundary  conditions  would  be  satisfied.  The  problem  was 
set  up  so  that  the  appropriate  boundary  equations  were  being 
fit  at  X=0  and  X=1  and  that  equation  (14)  was  being  fit  on  the 
set  of  points  .05(.05).95. 

Two  procedures  were  then  used  to  enforce  the  boundary 
conditions.  One  of  these  was  to  assign  a weight  of  10 
to  each  boundary  point  and  a weight  of  1 to  all  other  points 
and  use  a weighted  least  squares  solution. 

The  second  procedure  used  was  that  of  considering  the 
boundary  equations  with  the  approximations  substituted  at 
the  boundary  points  as  a system  of  equations  to  be  satisfied 
exactly  and  the  differential  equation  with  the  approximating 
functions  substituted  at  points  internal  to  the  interval, 
to  be  an  overdetermined  system  of  linear  equations  to  be 
solved  by  the  uniformly  weighted  least  squares  method. 

This  is  equivalent  to  a constrained  least  squares  problem. 

These  assumptions  were  effected  by  expressing  the  composite 
of  these  two  systems  as  a matrix  equation  and  partitioning 
the  matrices  appropriately.  This  led  to  two  matrix  equations 
that  were  solved  simultaneously.  The  format  used  required 
that  the  boundary  equations  be  given  first. 

Solution  Procedure  with  MACSYMA 


A general  input  procedure  for  setting  up  initial  and 
boundary  value  problems  in  MACSYMA  was  developed  and  followed 
for  the  specific  example  chosen.  Figure  3 outlines  this  pro- 
cedure. Symbols  used  in  MACSYMA  to  identify  the  various  entities 
are  given  where  applicable. 

A MACSYMA  demonstration  of  one  of  the  programs  following 
this  procedure  is  given  in  the  appendix.  Exact  rational 
arithmetic  was  used  in  these  calculations  to  demonstrate 
the  power  of  MACSYMA.  The  calculations  could  as  easily 
have  been  done  in  floating  point  arithmetic. 
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INPUT  PROCEDURE  for  LINEAR  INITIAL  and  BOUNDARY  VALUE  PROBLEMS 


Identify  the  constraining  equations 


The  solution  of  the  differential 
the  various  approximations  to  the 
aphical  form  using  the  plotting 
plots  were  obtained  on  a Tektronix 
Hard  Copy  Unit.  Figure  4 is  a 
function  and  Figures  5,  6,  and  7 
are  error  curves  for  the  approximating  functions  obtained. 

Table  1 is  a HACSYMA  printout  of  ten  polynomials  obtained 
as  approximations  to  the  solution  of  the  differential  equation. 
Table  2 identifies  each  of  the  polynomials  with  its  source  and 
gives  the  symbol  used  to  mark  the  corresponding  error  curve  in 
Figure  5,  6,  or  7. 


3 2 

+ 0.03080869't4  (1  - X ) + 0.2980095  ( 1 - X ) 


5 4 

FAPV(X)  = 0.051606234  (1  - X ) + 0.0157067273  (1  - X ) 

3 2 

0.0262761153  ( 1 - X ) + 0.30004945  ( 1 - X ) 

FLSAPV(X)  = 

5 

457272838901483990991068337593283439481040000000  (1  - X ) 


8860824 1398 15566982202625578080094691 64340969 3381 

4 

139176903304257447770480140366930693388042540000  (i  - x ) 


8860824139815566982202625578030094691643409603381 

3 

23282.5982386756901261204487764584379009033420000  ( 1 - X ) 


8860824139815566982202625573080094691643409693381 


2 

2658686008733236245532269471955192887452194692250  (l  - X ) 
8860824 139315566982202625578080094691643409693381 


5 4 

FHAPV(X)  = 0.052707125  (1  - X ) + 0.0125995092  (l  - x ) 

3 2 

+ 0.0292615863  (1  - X ) + 0.29891991  (1  - X ) 


FULSl-(X)  = - 0 . 05508089’!  X - 5.7278722E-3  X 0.035697244  x' 

2 

- 0.29656702  X + 4.7926395E-4  X + 0. 392789636 


5 43 

!><LS(X)  = - 0.055173752  X - 5.6203Y16E-3  X - 0.03595056  X 

2 

- 0.29639734  X + 4.7991139E-5  X + 0.39311364 


5 **3 

FCLS(X)  = - 0.055184639  X - 5. 60691386E-3  X - 0.035980111  X 

2 

- 0.296378005  X + 0.393149674 

5 , 4 

FSOLFT(X)  = 0.050618228  (1  - X ) + 0.015670215  (1  - X ) 

3 2 

+ 0.028779681  (1  _ X ) + 0.29837708  (1  - x ) 

5 4 . 3 

£POLFT^X)_  s - 0.053396624  X - 7.159276E-3  X 0.038121  192  X 

2 

- 0.29397657  X - 8.0931693E-!/  X + 0. 393478915 


Table  1 (ccntinued) 


Symbol  for 
Error  Function 


Approximating  Source  of  Approximating  Function 
Function 


1 

2 

3 


4 


5 


6 


7 


8 


9 

10 


FASOL(X) 

FGASOL(x) 

FAPV(x) 

FLSAPV(x) 

FHAPV(X) 


FULSI(X) 

FWLS(x) 


FCLS(x) 

FSOLFT(x) 

FPOLFT(x) 


Least  Squares,  continuous, 
with  (l-x^'*'^) 

Galerkin,  continuous,  with 

(1-xi+l  ) 

Gram-Schmidt  Least  Squares 
with  (l-x^'*'^  ) with  integral 
replaced  by  sum  over  points 
0( .05) .95 

Ordinary  Least  Squares  with 
(l-x^'*’^  ) using  rational 
arithmetic.  Integral  re- 
placed by  sum  over  points 
0( .05) .95 

Weighted  Gram-Schniidt  Least 
Squares  with  (l-x^"^^)  using 
weight  of  1/2  at  x=0  and 
weight  of  1 at  all  other 
points.  Integral  replaced 
by  sum  over  0(. 05)1.0 

Gram-Schmidt  Least  Squares 
with  (x^“^)  Integral  replaced 
by  sum  over  0(. 05)1.0 

Weighted  Gram-Sphmidt  Least 
Squares  with  (x^"^  ).  Boundary 
conditions  weighted  10,  other 
points  weighted  1.  Integral 
replaced  by  sum  over  points 
0( .05)1.0 

Gram-Schmidt  constrained  Least 
Squares  with  (x^“^  ).  Boundary 
conditions  forced.  Integral 
replaced  by  sum  over  .05(.05).95 

Polynomial  fit  of  the  exact  . 
solution,  FN(x),  using  (l-x^"*"^) 

Polynomial  fit  of  the  exact 
solution,  FN(x),  using  ( x^“^  ) 


Table  2 - Identification  of  Functions 
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APPENDIX:  MACSYMA  demonstration  of  the  solution  of 

V(,-.*).2x^  l-x* 

dx 

b;  the  continuous  Least  Squares  method  using  approximating  functions  of  the 
form  (I  - x")  for  2 < n <7 
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(C9)  FOR  1 THRU  NF  DO  FOR  J THRU  1 DO  1HT( I, J] : INTECRATE(EF{ 1 ]*EF[ J] ,X ,0 

1); 

DEFINT  FASL  DSK  HACSTM  being  loaded 
loading  done 

LINIT  FASL  DSK  MACSTH  being  loaded 
loading  done 

RESIDO  FASL  DSK  MACSTH  being  loaded 
loading  done 


SIR  FASL  DSK  MACSTH  being  loaded 
loading  done 


SCHATC  FASL  DSK  MACSTH  being  loaded 
loading  done 
T1HE>  8981  MSEC. 

(D9)  DONE 


Tcio) 

FOB 

THRU  HF 

DO  FOB  J 

THBU  1-1 

DO  INTlJ 

1):1NTII,J) 

J 

TIME* 

89 

KSEC. 

(DIO) 

DONE 

Ten) 

MINT 

CENMATRIXCINT.NF, 

NF); 

TIHE> 

21 

HSECo 

1196 

167 

2008 

223 

2020* 

853 

) 

] 

) 

) 

1 

) 

) 

315 

[ 

56 

1155 

8*0 

15015 

280 

16T 

67909 

3917 

2587901 

219*93 

72931 

56 

13860 

630 

360360 

27720 

8580 

1 

2008 

3917 

69616 

277 

8962*8 

122677 

] 

) 

1 

1155 

630 

6*35 

18 

*50*5 

50*0 

(Dll) 

223 

2587901 

277 

1093003 

18583*9 

125*5701 

) 

) 

) 

) 

1 

) 

) 

) 

) 

) 

) 

) 

[ BBO 

360360 

18 

*50*5 

55**0 

2917?0 

2020B 

- 

219*93 

~8952*8 

18593*9 

36916532 

1763651 

[ 15015 

[ 

27720 

*50*5 

55**0 

765765 

27720 

853 

[ . ... 

72931 

122677 

125*5701 

1763651 

*7*70367 

280 

8580 

50*0 

291720 

27720 

55*268 

i 
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WP 


(C12)  INC(l,J]:z 
TlHEz  I MSEC. 
(D12) 


1HTEG»ATE(EF(J)»»HS(E0),X,0,1): 

INC  :<  1HTECRATC(CF  RHS(EO), 

1.  J 1 


X,  0, 


Tc13)  MING:CENHATIIIX(INC,NF,1); 
TIME*  1170  MSEC. 


(013) 


35 

179 

120 

10« 

63 

53 

30 

( 


( 6«2e 
( — 
[ 3«65 


( «0 


(CIA)  AA:(M1NT**-1).M1NC: 

TlHEz  2798  MSEC. 

{ 1*1380031 311999339e26900A9e5979eS9 


( «663686919507035939586U2e97627209 

I 21639878065592633670893528215808 


13991060758521107818758*28692881627 

957375*69919753387681595*3776838* 


[ 13991060758521 107818758*28692881627 

(D1«)  ( 

[ 7761*519628**55097028078122*52*8 

[ *6636869195070359395861*2897627209 

1 8706*803127213285822*35755980758 


t 13991060758521107818758*28692881627 
I *6286096750200933*96768518*765** 


[ *6636869195070359395861*2897627209 

7ci5)  aan;e«(aa,numen): 

TlMEz  10  MSEC. 


(015) 


10,30315078  j 
1.5*669316E-3  ) 

I 1 

[ 0.068*2765*  ] 

I) 

0.0166*2309  ] 

) 

- 6.222888E-3  ) 

) 

( 9,92*786**E-3  ] 


1) 


) 

) 

1 

) 

) 

) 

) 

) 

) 

) 

) 

) 

) 

) 


) 

) 

) 

) 

) 

) 
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(C16)  ASOL:»A.HFM; 

T1HE>  IB  KSEC. 

7 

«62e6096750200933*9B7B8SlB>l76SA«  (1  - X ) 

(016)  — 

«663686919B07035939S861<I2897627209 

6 

87O6k8031272132B5B22938798980788  (1  - X ) 


1399 1060798921 107818758428692881627 

5 

776145196284455097028078122452«8  (1  - X ) 


4663886919507035939586142897627209 

4 

957375469919753387681595437768384  (1  - X ) 
<■  » 
13991060758521 107818758428692881627  ' 

21639878065592633670853528215808  (1  - X^). 


13991060758521107818758428692881627 

"2 

1413800313119993398289004985979855  (1  - X ) 


4663686919507035939586142897627209 


(C26)  *S0LH:EV(AS0L,RATS1MP): 
nME=  576  ICBC, 

7 

(026)  - (138858290250602800490305555429632  X 

6 5 

- 87064803127213285822435755980758  X ♦ 232843558885338529108423436735744  X 

♦ 95(375469919753387681595437768384  X * 21639878065592633670853528215808  X 

♦ 4241400939359980194867014957939565  X - 5505053333354052259995757160108375) 
/13991060758521 107818758428692881627 
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(C2T)  FOR  1:0  THRU  1 STEP  1/10  DO 

(*PP:EV(RS01.,X:I),RP:EV(RPP,NUHER), 

S0:EV(1-EXP((X>X-1)/2).X:I,NUHER), 

ER:S0-AP, 

PRIHT(*X.»,I,*APPROX**,»P,*ASOLH.*,SO,*ERROR»»,ER)); 

X>  0 APPROXi  0.393A6933  ASOLN>  0.393A693R  ERROR<  1.11T5BT1E-B 
1 

X«  — APPROX*  0.390A292T  ASOLN*  0.390R291  ERROR*  - 1.T1363353E-T 
10 
1 

Z*  - APPROX*  0.3B121639  ASOLN*  0.3B12166  ERROR*  2. 123A 1SATE-T 
•5 
3 

Z*  — APPROX*  0.36355166  ASOLN*  0.3655520A  ERROR*  3.1625A32E-T 
10 
2 

Z*  - APPROZ*  0.3R29532S  ASOLN*  0.3«29531B  ERROR*  - 1.00S62B36E-T 
1 

Z*  - APPROX*  0.31271119*  ASOLN*  0.312710725  ERROR*  - 4.693665SE-7 
2 

3 

Z*  - APPROZ*  0.273B511*  ASOLN*  0.273650955  ERROR*  - 1 .662e*515E-7 
5 
7 

Z*  --  APPROX*  0.2250631S  ASOLN*  0.2250835  ERROR*  3.*631*64E-7 
10 

4 

Z*  - APPROX*  0.164729524  ASOLN*  0.16472979  ERROR*  2.6449561E-7 

5 

9 

Z*  — APPROX:  0.090627265  ASOLN*  0.09062706  ERROR*  - 2.05B2229E-7 

10 

Z*  1 APPROX*  • ASOLN*  0 ERROR*  0 
TIHE*  190B  MSEC. 

(D27)  DONE 


(C28)  ERR:lNTECRATE(EV((LHS(E0)-RHS(EQ))**2.V:ASOLR,Dirr,RATSIHP).X,0,1); 

TIHE*  145  MSEC.  3967350112145544592*9867565*8 


Tc29)  err»*.5; 

TIHE*  'S'hJECT 

(D29) 

?lHEs  4284  HSEC. 
ID30) 


69255750754679*8370285*22202976405365 


7.0757247E-5 


batch  DONE 
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MOVING-WEIGHTED-AVERAGE  SMOOTHING  EXTENDED  TO 
THE  EXTREMITIES  OF  THE  DATA 
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ABSTRACT 

A symmetrical  moving  weighted  average  (MWA)  for  smoothing  observational  data  which 
may  be  regarded  as  equally  spaced  measurements  of  a function  of  one  variable  has  the  form 


u » y c.  y . , 

* j=-m  ^ ’‘-J 


(1) 


where  is  an  observed  value,  u^  is  the  corrcsix>nding  smoothed  value,  and  the  c^ 

are  real  coefficients  whose  sum  is  unity,  with  j ' ' This  process  does  not  yield 

smoothed  values  of  the  first  m and  the  last  m observations  unless  additional  data  arc 
available.  A natural  method  is  suggested  for  extending  the  smoothing  to  the  extremities 
of  the  data. 

If  (1)  is  exact  for  polynomials  up  to  the  degree  2h  - 1,  it  can  be  written  in  the 

form 

- tl  - (-1)®  «^^q(E)Jy^  , 


where  6 is  the  finite  difference  taken  centrally,  E Is  defined  by  Ef(x)  * f(x  t 1)  , 
and 

m-R 

q(E)  ^ I q 
j»-m*s  ^ 


for  some  coefCicients  , 
expansion 


If  q(z)  has  no  zero  on  the  unit  circle,  there  is  a Laurent 

m 

tq(z)r*  - I h. 

j“-"-  ^ 
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convergent  in  an  annulus  containing  the  unit  circle. 


We  regard  the  overall  smoothing  process  as  a matrix-vector  operation 

u = Gy  , (2) 


where  u and  y are  vectors  of  N components  and  G is  symmetric  with  rows , except 
for  the  first  m and  the  last  m,  that  merely  reflect  the  application  of  (1)  . We  de- 
termine the  first  m and  the  last  m rows  by  taking 

G = I - DK  , 


where  K is  the  matrix  of  N - s rows  and  N columns  that  transforms  a vector  into 

the  vector  of  sth  finite  differences  of  its  components,  and  D is  the  symmetric  matrix 

of  order  N - s whose  inverse  is  the  Toep] itz  matrix  T = (t.  .)  = (t.  with 

xg  i-g 

t,  . •=  h,  . . 
i-J  i-3 


The  same  vector  u can  be  obtained  by  a computational  short-cut.  Let  p(z)  be  the 
monic  polynomial  of  degree  m - s whose  zeros  are  those  zeros  of  q(z)  lying  within 
the  unit  circle,  and  let 


tU  , 

t \ t mvS,.  m r m-i 

a(z)  = (z  - 1)  p(z)  = z - 2,  a.  z . 

Then,  if  the  range  of  x is  from  A to  B,  recursively  calculate  fictitious  extended 
values  y^  for  x=A-l,  A-2,  ...,A-m  by 


^x  “j  Vj 

1=1 


Similarly,  calculate  extended  values  for  x=B+l,  B+2,  ...,B+m  recursively  by 

m 

y * y a . y 

X 3 ^x-3 

Finally,  apply  (1)  to  the  entire  sequence  of  observed  and  extended  values  to  obtain 
smoothed  values  for  x»A,  A*I>1, 

Schoenberg  (1946)  defined  the  characteristic  function  of  (1)  as 

in 


♦ (t)  = I c e 
j=-m  ^ 


ijt 


i 


1 


j 
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! and  calls  an  MWA  a smoothing  formula  if 

I 

I 


j -1  1 iKt)  < 1 , 

! 

with  some  ambiguity  as  to  whether  the  inequalities  should  be  strict  for  0 < t < 2it 

It  is  shown  here  that  the  limit  lim  g"  exists  for  all  N > 2m  if  and  only  if 

n -*•■» 

-1  £ ♦(t)  < 1 for  0 < t < 2ii 

If  0<^^(t)  < 1 for  0 < t < 2ti,  (2)  is  equivalent  to  the  minimization  of 

(u  - y)’^  (u  - y)  + (Ku)’^  HKu  , 

-1  T -1 

where  H = (D  - KK  ) is  positive  definite.  This  generalizes  the  Whittaker  (1923) 
smoothing  process. 

EXPLANATION 

The  use  of  a moving  weighted  average  of  2m  + 1 terms  to  smooth  equally  spaced  ob- 
servations of  a function  of  one  variable  does  not  yield  smoothed  values  of  the  first  m i 

I 

and  the  last  m observations,  unless  additional  data  beyond  the  range  of  the  original 
observations  are  available.  Using  Toeplitz  matrices,  Laurent  series,  and  analogies  to 
the  Whittaker  smoothing  process,  we  develop  a natural  method  of  extending  the  smoothing 
to  the  extremities  of  the  data. 

AMS(M05)  Subject  Classification  - 

Key  Words  - Smoothing,  Toeplitz  matrix,  Laurent  series.  Moving  weighted  average 
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5A3 


1.  INTRODUCTION 


A time-honored  method  of  smoothing  equally  spaced  observations  of  a function  of  one 
variable  to  remove  or  reduce  unwanted  irregularities  is  the  moving  weighted  average  , 
An  example  is  Spencer's  15-term  average  (Macaulay  1931!  Henderson  1938),  which  c«m  be  ex- 
pressed in  the  form 


u 

X 


320  " ^^x-6 


5y  c + 


3V4  + 21y^.3  + 46y^.2  + 67^_^ 


+ 74y 


+ 67y^^^  + 46y^^2  + 


2^yx«  ^ 


3y, 


x+4 


5y. 


x+5 


6y  , - 3y  _) 
x+6  ^x+7 


(1.1) 


where  y^  is  the  observed  value  corresponding  to  the  argument  x,  and  u^  is  the  cor- 
responding adjusted  value.  Actuarial  writers  commonly  refer  to  such  smoothing  of  data  as 
"graduation. " 

More  generally  (Schoenberg  1946)  a symmetrical  MWA  is  of  the  form 


m 

u = y c.  y 

X j ^ 1 X-T 

j=-m  ■’ 


(1.2) 


where  m is  a given  positive  integer  and  the  real  coefficients  c^  are  such  that  c_j  •=  c^ 
and 

m 

I ‘^j  - 1 • 


j— m 


Such  averages  have  a long  history,  that  includes  some  eminent  names,  but  the  literature 
concerning  them  is  little  known  in  the  general  mathematical  community.  Among  t'ne  early 
writers  on  the  subject  was  the  Italian  astronomer  G.  V.  Schiaparelli  (1866),  who  is  chiefly 
remembered  for  his  observations  of  the  planet  Mars.  The  majority  of  publications  in  this 
area  have  appeared  in  English  and  Scottish  actuarial  journals  starting  with  John  Finlalson 
in  1829  (see  Maclean  1913) . Probably  the  first  writer  to  make  a systematic  investigation 
of  such  averages  was  the  American  mathematician  E.  L.  De  Forest  (1873,  1875,  1876,  1877). 
His  work,  published  in  obscure  places,  was  rescued  from  total  oblivion  largely  through  the 
efforts  of  Hugh  H.  Holfenden  (1892-1968) , who  also  made  important  contributions  to  the 
subject  (Nolfenden  1925).  E.  T.  Whittaker  (1923)  suggested  an  alternative  method  of 
Sponsored  by  the  United  States  Army  under  Contract  No.  DAAG29-75-C-0024 . 
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smoothing,  which  has  been  widely  employed,  especially  by  actuaries,  and  will  be  referred  to 
extensively  later,  because  of  numerous  analogies  to  the  IWA  procedure.  The  first  writer  to 
apply  sophisticated  mathematical  tools  to  the  study  of  these  averages  was  I.  J.  Schoenberg 
(1946,  1958,  1953) , who  introduced  the  notion  of  the  characteristic  function  of  an  MWA, 
and  utilized  it  to  formulate  a criterion  for  judging  whether  a given  average  can  properly 
be  called  a “smoothing  formula."  This  criterion  will  be  discussed  in  Section  10. 

2.  THE  PROBLEM  OF  SMOOTHING  NEAR  THE  EXTREMITIES  OF  THE  DATA 

When  Ml4A's  have  been  used  by  actuaries,  the  argument  x is  usually  age  (of  a person)  in 
completed  years.  When  they  are  used  for  smoothing  economic  time  series,  x denotes  the 
position  of  a particular  observation  in  a time  sequence.  T)ie  latter  area  of  application 
appears  to  stem  largely  from  the  wor)<  of  Frederic):  R.  Macaulay  (1931)  , who  was  the  son  of 
an  actuary. 

In  eitlier  case,  a serious  disadvantage  of  the  method  is  that  it  does  not  produce  ad- 
justed values  for  arguments  too  near  the  extremities  of  the  data.  For  example,  suppose 
Spencer's  15-term  average  is  used  to  sraootli  monthly  data  extending  from  1970  through  1976. 

The  formula  does  not  give  smoottied  values  for  the  first  7 months  of  1970  or  the  last  7 
months  of  1976  unless  data  can  be  obtained  for  the  last  7 months  of  1969  and  the  first  7 
months  of  1977.  Clearly,  acquisition  of  data  extending  farther  into  the  past  is  less  of  a 
problem  than  acquisition  of  future  data. 

Actuaries  in  North  America  seem  to  have  largely  abandoned  the  use  of  MWA's  in  favor 
of  Whittalrer's  method,  which  does  not  have  the  disadvantage  described.  It  is  li):ely  that 
British  actuaries  may  still  use  these  averages  to  some  extent.  They  appear  to  be  currently 
employed  by  economic  and  demographic  statisticians  (Shiskin,  Young,  and  Musgrave  1967). 

Various  suggestions  liave  been  made  (De  Forest  1877,  Miller  1946,  Greville  1957, 

1974a)  for  dealing  with  tlie  problem  of  adjustment  of  data  near  the  extremities,  but  none 
of  them  have  won  general  acceptance.  De  Forest's  (1877,  p.  110)  suggestion  is  so  relevant 
to  the  subject  of  the  present  paper  that  it  is  worth  quoting  in  full: 

"As  the  first  m and  the  last  m terms  of  the  series  cannot  be  reached  directly  by 
the  formula  (of  2m  + 1 terms! , the  series  should  be  graphically  extended  by  m terms 
at  both  ends,  first  plotting  the  observations  on  paper  as  ordinates,  and  then  extending 


545 


the  curve  along  what  seems  to  be  its  probable  course,  and  measuring  the  ordinates  of  the 
extended  portions.  It  is  not  necessary  that  this  extension  should  coincide  with  what  would 
be  the  true  course  of  the  curve  in  those  parts.  The  important  part  is  that  the  m terms 
thus  added,  taken  together  with  the  m + 1 adjacent  given  terms,  should  follow  a curve 
whose  form  is  approximately  algebraic  and  of  a degree  not  higher  than  the  third." 

Elsewhere  (Greville  1974a)  I have  proposed  extrapolating  the  observed  data  by  fitting 
a least-squares  cubic  to  the  first  m + 1 values  and  a sijnilar  cubic  to  the  last  m t 1 
observations.  This  is  very  much  in  the  spirit  of  De  Forest's  suggestion:  it  is  not  a long 
step  from  graphic  to  algebraic  extrapolation. 

Another  approach  (Greville  1957)  regards  the  adjustment  process  as  a matrix-vector 
operation.  We  write 

u = Gy  , 

where  y is  the  vector  of  observed  values,  u is  the  corresponding  vector  of  adjusted 
values,  and  G is  a square  matrix.  If  a specified  symmetrical  MWA  of  2m  + 1 terms  is 
to  be  used  wherever  possible,  then  the  nonzero  elements  of  G,  except  for  the  first  m 
and  the  last  m rows,  are  merely  the  weights  in  the  moving  average,  these  weights  moving 
to  the  right  as  one  proceeds  down  the  rows  of  the  matrix.  In  the  first  m and  the  last 
m rows  special  unsymmctrical  weights,  determined  in  some  appropriate  manner,  must  be  in- 
serted. The  matrix  approach  and  the  extrapolation  are  not  wholly  unrelated,  since  the  final 
results  of  the  extrapolation  approach  can  be  expressed  in  matrix  form. 

It  is  the  purpose  of  the  present  paper  to  show  that  when  a given  MWA  is  being  employed, 
there  is  a natural,  preferred  method  of  extending  the  adjustment  to  the  extremities  of  the 
data,  strongly  suggested  by  the  mathematical  properties  of  the  weighted  average.  This 
natural  metho<1  of  extension  seems  to  have  eluded  previous  writers  on  the  subject,  as  indeed 
it  eluded  me  during  the  many  years  I have  thought  about  the  matter.  The  preferred  method 
of  extension  has  the  interesting  property  that  it  can  be  arrived  at  eitlier  through  the 
matrix  approach  or  the  extrapolation  approach.  In  the  latter  case,  one  must  employ  a 
special  extrapolation  formula  uniquely  determined  by  the  given  MWA.  Though  the  two  ap- 
proaches appear  to  be  quite  different,  they  will  be  shown  in  Section  9 to  be  mathematically 
equivalent,  and  they  will  give  identical  results  except  for  rounding  error. 
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In  my  own  thinking  I arrived  at  the  procedure  first  through  the  matrix  approach, 
guided  largely  by  extensive  analogies  to  the  Whittaker  process  (which  is  most  conventicntly 
expressed  in  matrix  terms).  It  was  only  later  that  I became  aware  that  identical  results 
could  be  obtained  by  means  of  an  extrapolation  algorithm.  Though  the  matrix  approach  pro- 
vides far  greater  insight  into  the  rationale  behind  the  procedure,  the  extrapolation  ap- 
proach is  simpler  computationally.  Therefore,  we  shall  first  describe  and  illustrate  tlic 
extrapolation  algorittim,  and  shall  then  motivate  and  justify  the  procedure  by  moans  of  the 
matrix  approach. 

The  extrapolation  approach  is  merely  a computational  short  cut,  and  nearly  always  1 ho 
extended  values  obtained  by  its  use  are  highly  unrealistic  if  regarded  as  extrapolated 
values  of  the  function  under  observation.  This  fact  is  irrelevant,  but  has  seriously 
"turned  off"  some  users.  Hereafter  I shall  therefore  avoid  the  use  of  the  words  "extrap- 
olate" and  "extrapolation,"  and  shall  speak  of  "extension,"  "extended  values,"  and  "inter- 
mediate values." 

It  is  emphasized  that  the  procedure  to  be  described  (or  any  other  procedure  for  com- 
pleting the  graduation)  is  recommended  for  use  only  when  additional  data  extending  beyond 
the  range  of  the  original  data  are  not  available. 

3.  THE  EXTENSION  ALOORITHM 

A weighted  average  of  the  form  (1.2)  will  bo  called  exact  for  the  degree  r if  it  has 

the  property  that,  in  case  all  the  observed  values  y . in  (1.2)  should  happen  to  be  the 

^“3 

corrcsijonding  ordinates  of  some  polynomial  P(x  - j)  of  degree  r or  less,  then 

u = y = P(x)  . (3.1) 

x x 

In  other  words,  an  average  that  is  exact  for  the  degree  r reproduces  without  cliango  poly- 
nomials of  degree  r or  less.  If  the  weights  are  symmetrical,  r must  be  odd,  and  wc  may 
write  r = 2s  - 1 . This  implies  that  r < 2m  + 1 , and  therefore  s £ m . 

For  a simple  (unweighted)  average,  r = 1 . For  the  overwliclming  majority  of  MWA's 
used  in  practice,  r = 3 . The  preference  for  cubics  has  a long  history.  De  Forest  (1873, 
p.  281)  suggests  that  "a  curve  of  the  third  degree,  which  admits  a point  of  inflexion  ... 
is  . . . better  adapted  than  the  common  parabola  to  represent  the  form  of  a scries  whose 
second  difference  changes  its  sign." 
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We  shall  use  the  notation  of  the  calculus  of  finite  differences,  wherein  E is  the 
"displacement  operator"  defined  by 

Ef(x)  = f(x  + 1)  , 

and  6 is  the  "central  difference"  operator  defined  by 

6f(x)  = f(x  + |)  - f{x  - j)  , 

so  that 

6^  f(x)  = f(x  + 1)  - 2f(x)  + f(x  - 1)  . 

If  the  weighted  average  (1.2)  is  exact  for  the  degree  2s  - 1,  it  can  bo  written  in  the 
form 

u^  = (1  - (-1)®  6^®  q(E)]y^  , (3.2) 

where  q(E)  is  of  the  form 

m~s 

q(E)  = I q e’  (3.3) 

j=-inHs  ^ 

with  q_^  = . In  a typical  smoothing  formula  q(E)  has  only  positive  coefficients,  but 

this  is  not  necessarily  the  case.  If  q(z)  is  multiplied  by  z to  eliminate  negative 
exfxjnents,  the  resulting  polynomial  is  of  degree  2m  - 2s  . Because  of  the  symmetry  of  the 
coefficients, .it  is  a reciprocal  polynomial.  In  other  words,  if  r is  a zero  of  the  poly- 
nomial, it  follows  that  r ^ is  a zero.  We  shall  ma)<e  the  assumption  that  this  polynomial 
has  no  zero  on  the  unit  circle.  If  it  does  have  such  zeros,  the  extension  of  the  smoothing 
process  to  the  extremities  of  the  data  is  undefined. 

Let  p(z)  denote  the  polynomial  of  degree  m - s with  leading  coefficient  unity 
whose  zeros  are  the  m - s zeros  of  z q(z)  located  within  the  unit  circle.  In  gen- 

eral, some  or  all  of  these  zeros  are  complex,  but  they  must  occur  in  conjugate  pairs,  so 
that  p(z)  has  real  coefficients.  Now  we  define  a polynomial  a(z)  of  degree  m and  its 

coefficients  a.  by 
3 

m 

a(z)  = (z  - 1)®  p(z)  = z"*  - 1 a.  z"'  ^ . (3.4) 

j=l  ’ 

Suppose  the  given  data  consist  of  N^B-A+1  given  values  extending  from  x = A 
to  X = B . We  assume  that  N > 2m  + 1,  so  that  at  least  one  smoothed  v'alue  is  obtained 
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by  direct  application  of  the  given  MKA.  Then  wc  obtain  m intermediate  values  to  the  left 
of  X « A by  successive  application  of  the  recurrence 

= "jVj  • 

Similarly,  m intermediate  values  to  the  right  of  x = B will  be  obtained  by  the  analogous 
recurrence 


= 


I 

j=l 


a . 
1 


x-g 


Finally,  application  of  the  symmetrical  MWA  of  2m  + 1 terms  to  the  N + 2m  observed 
and  intermediate  values  gives  adjusted  values  u^  for  x = A,  A + 1,  B . 

For  example,  Spencer's  15-term  formula  (1.1)  can  be  expressed  in  the  form  (3.2)  with 
8=2,  where 


q(E)  = (3E~^  + 18e“'‘  + 59e'^  + 137E  ^ + 242e”^  + 318  + 242E  + 137E^ 

3 4 5 

+ 59E  + 18E  + 3E  ) . 

Using  a computer  program  to  find  the  zeros  of  z^  q(z),  constructing  the  polynomial  P(z)  , 
and  finally  applying  the  formula  (3.4),  we  obtain  for  Spencer's  15-term  formula 

a(z)  = z’  - .961572Z®  - .372752z^  - .015964z'’  + .123488z^  + .125229z^ 

+ ,075887z  + .025624  . 


The  coefficients  are  rounded  to  the  nearest  sixth  decimal  place,  except  that  tlie  final 

3 2 

digits  of  the  coefficients  of  z and  z have  been  adjusted  by  one  unit  to  make  the  sum 
of  the  coefficients  exactly  zero. 

Note  that  in  the  trivial  case  s = m,  q(z)  is  a constant  and  p(z)  is  unity.  Thus 
the  algorithm  reduces  bo  extrapolation  of  the  observed  data  by  st)>  differences  (i.e.,  by 
fitting  a polynomial  of  degree  s - 1 to  the  first  s observations) . 

As  a numerical  illustration,  Spencer's  15-term  average  has  been  applied  to  some 
meteorological  data.  Table  1 and  Figure  A show  the  observed  and  graduated  values  of  monthly 
precipitation  in  Madison,  Wisconsin  in  the  years  1967-71.  No  adjustment  has  been  made  for 
the  unequal  length  of  the  months. 
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Monthly  Precipitation  (inches),  Madison,  Wisconsin,  1967-71 


Year  and  Month 

Observed 

Value 

Graduated 

Value 

Year  and  Month 

Observed 

Value 

Graduated 

Vailue 

1967  January 

1.63 

1.11 

1969  July 

4.28 

3.81 

February 

1.17 

1.63 

August 

0.96 

3.17 

March 

1.49 

2.24 

September 

1.35 

2.33 

April 

2.57 

2.88 

October 

2.65 

1.S6 

Kay 

3.53 

3.42 

November 

0.70 

1.06 

June 

6.46 

3.74 

December 

1.66 

0.82 

July 

2.51 

3.85 

1970  Janua^TT 

0.44 

0.90 

August 

2.71 

3.75 

February 

0.16 

1.25 

September 

2,68 

3.42 

March 

1.17 

1.78 

October 

5.52 

2.92 

April 

2.53 

2.39 

November 

1.83 

2.31 

May 

6.09 

2.94 

December 

1.89 

1.69 

June 

2.26 

3.37 

1968  January 

0.56 

1.31 

July 

2.42 

3.63 

February 

0.49 

1.36 

August 

0.97 

3.69 

March 

0.59 

1.6? 

September 

8.82 

3.50 

ApcU 

4.18 

2.69 

October 

2.65 

3.20 

May 

2.02 

3.49 

November 

1.06 

2.74 

June 

7.82 

3.91 

December 

2.12 

2.28 

July 

2.54 

3.92 

1971  January 

1.48 

1.94 

August 

2.58 

3.54 

February 

2.59 

1.76 

September 

4.45 

2.97 

March 

1.52 

1.74 

October 

0.85 

2.45 

April 

2.42 

1.81 

November 

1.74 

1.99 

May 

0.98 

1.93 

December 

2.89 

1.64 

June 

2.27 

2.02 

1969  January 

2.26 

1.56 

July 

1.65 

2.13 

February 

0.18 

1.81 

August 

3.96 

2.24 

March 

1.47 

2.35 

September 

1.87 

2.40 

April 

2.72 

3.13 

October 

1.30 

2.63 

May 

3.45 

3.81 

■November 

3.48 

2.84 

June 

7.96 

4.05 

December 

3.64 

3.28 

SOUBCEt  Observed  values  from  U,  S,  Department  of  Commerce,  National 
Oceanic  and  Atmospheric  Administration,  Environmental  Data  Service, 
Local  Cllr,atologlcal  Data.  Annual  Suiwnarv  with  CoRmratlve  Data. 

• 0, , 


Madison.  Wisconsin.  1972.  National  Climatic  Center,  Asheville,  H 
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For  the  convenience  of  the  user,  the  weighted-average  coefficients  and  the  inter- 
mediate-value coefficients  for  those  averages  that  appear  to  be  in  common  use  or  are  found 
in  publications  accessible  to  roe  are  given  in  the  next  section  in  Tables  2 and  3.  The 
reader  who  is  more  interested  in  the  justification  of  the  procedure  and  the  rationale  be- 
hind it  may  skip  at  once  to  Section  5. 

4.  TABU:S  OF  MOVING-AVERAGE  AND  EXTENSION  COEFFICIENTS 


Tables  2 and  3 show  t)io  coefficients  in  the  MWA  and  the  corresponding  extension  co- 
efficients (that  is,  c^  and  a^)  for  21  weighted  averages  that  have  appeared  in  the  liter- 
ature. Table  2 is  devoted  to  the  class  of  averages  known  to  actuaries  as  roinimum-Rj  formu- 
las and  to  economic  statisticians  as  "Henderson's  ideal"  formulas.  They  arc  discussed 
more  fully  in  Section  7.  The  values  in  Table  2 are  shown  to  six  decimal  places.  In  both 
instances,  a few  final  digits  have  been  adjusted  by  one  unit  to  make  the  sum  exactly  unity. 
The  moving-average  coefficients  are  given  to  the  nearest  sixth  decimal  place  except  for  the 
slight  adjustments  mentioned;  rounding  error  in  the  computation  of  the  extension  coeffici- 
ents may  have  introduced  further  small  errors  in  some  instances. 

Table  3 is  concerned  with  11  moving  averages  derived  by  various  writers  on  an  ad  hoc 
basis  and  known  by  the  names  of  their  originators.  The  source  notes  for  this  table  do  not 
attempt  to  cite  the  earliest  publication  of  the  formula  in  question,  but  merely  indicate 
a convenient  reference  where  it  can  be  found.  All  these  averages  are  exact  for  cubics  ex- 
cept Hardy's,  which  is  exact  only  for  linear  functions.  The  coefficients  in  the  averages 
of  Table  3 are  rational  fractions  with  relatively  small  denominators,  and  the  user  will 
probably  find  it  convenient  to  use  as  weights  the  integers  in  the  numerators  of  the  co- 
efficients, dividing  by  the  common  denominator  as  the  final  step.  The  column  headings, 
therefore,  are  c^  multiplied  by  the  common  denominator. 

In  both  Tables  2"  and  3 advantage  has  been  taken  of  the  symmetry  of  the  coefficients 
to  reduce  the  length  of  the  columns  by  approximately  one-half.  The  manner  of  using 
the  tables  may  be  illustrated  by  taking  Sj>cncer's  15-term  average  as  an  example.  Equation 
(1.1)  shows  the  calculation  of  tlie  moving  averages.  The  intermediate  values  y^  for 
X“A-1  to  A-7  are  calculated  successively  by  the  formula 
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= .961572y^^j  + .372752y^^2  + ' •^25229y^^^ 

- - •°25624y^^^  . 

The  intermediate  values  for  x=B+l  to  B+7  are  calculated  by  the  identical  formula 
except  that  the  "+"  signs  in  the  subscripts  are  changed  to  "-"  signs. 

The  extension  procedure  drastically  reduces  the  number  of  values  that  need  to  be 
tedjulated  for  a given  weighted  average,  and  makes  it  possible,  for  example,  to  give  com- 
plete information  about  21  such  averages  in  the  reasonably  compact  Tables  2 and  3.  How- 
ever, the  user  who  intends  to  apply  a single  weighted  average  to  many  data  sets  may  prefer 
to  tabulate  the  atypical  elements  of  the  smoothing  matrix  G for  that  weighted  average, 
and  so  avoid  the  extra  step  of  calculating  the  intermediate  values.  For  the  benefit  of 
such  users,  a metliod  of  calculating  the  atypical  rows  of  G will  now  be  described.  Justi- 
fication of  the  procedure  will  be  given  in  Section  9 (see  equation  (9.10)).  We  observe 
that  the  nonzero  elements  in  each  row  of  G except  the  first  m and  the  last  m rows 
are  merely  the  coefficients  c^  of  the  MWA  centered  about  the  diagonal  element.  The 
elements  in  the  first  ra  rows  of  G,  except  for  the  first  m columns,  follow  from  the 
symmetry  of  G,  and  if  G - have 


g. . = c . . . 

3-1 

This  leaves  only  the  square  submatrix  of  order  m in  the  upper  left  corner  to  be  calcu- 


lated. Let  c denote  the  constant  -a  /p  , whore  -p  is  the  term  free  of  z in 

^-s  m-s  *^m-s 

the  polynomial  p(z),  and  let  denote  the  square  matrix  of  order  m given  by 

I 0 for  i > j 

a . . = <1  for  i = j 


-a . , for  i < j 
3-1 


Then  the  required  submatrix  in  the  upper  left  corner  of  G is  given  by 

I-ca[a^  . 

The  similar  subraatrix  in  the  lower  right  corner  of  G contains  the  same  elements,  but 


with  the  order  of  both  rows  and  columns  reversed. 


2,  Moving-Average  Coefficients  (cj)  and  Extension 
Coefficients  (sj)  of  Mlnlmuja-R^  ("Henderson's  Ideal") 
Averages  of  5 to  23  Terns  Exact  for  Cublcs 


Number  of  Terms 

5 

7 

9 11 

13 

J 

a 

"j 

a 

“j  "j  "j 

a 

c . 

J 

"j 

i 

6 .559VtO 

.412588 

.331140 

.277944  .240058 

1 .293706  2 

.293706  1.610034 

.266557  1.352613 

.238693  I.16O8II  .214337  1.016301 

2 -.073426  -1 

.058741  -.236068 

.118470  ,114696 

.141268  .281079  .147356  .360880 

3 

-.058741  -.381966 

-.009373  -.287231 

.035723  -.140968  .065492  -.021625 

4 

-.040724  -.180078 

-,026792  -.204545  0 -.160909 

5 

-.027864  -.096377  -.027864  -.I38330 

6 

-.019350  -.056317 

^Calculated  by  fornula  (7.5). 
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2,  Monring-Averago  Coefficients  (cj)  and  E^enslon 


Coefficients  (a^)  of  Mlnlnum-R^  ("Henderson's  Ideal") 
Arerages  of  5 to  23  Terms  Exact  for  Cublcs  (continued) 


Number  of  Terms 

15 

17 

19  21  23 

i & 

J “.1.. 

".1 

a 

".1 

a 

a 

a 

a . c . 

j . 

a . 

6 .2115'*2 

.189232 

.171266 

.156470 

.144060 

1 .1937'^2 

.903661 

.176390  .813444 

.161691 

.739580  .149136 

,678000  .138313 

.625880 

2 ,145904 

.397295 

.141112  .410385 

.13'+965 

.412090  .128423 

.406495  .121949 

.397207 

3 ,082918 

.064751 

.092293  .124932 

.096658 

.166162  .097956 

.193174  ,097395 

.212501 

4 .024028 

-.100710 

.042093  -.043456 

.054685 

.005097  .063038 

.046016  .068303 

.075236 

5 -.014134 

-.135445 

.002467  -.1106';4 

.017474 

-.078255  .029628 

-.046290  .038933 

-.015313 

6 -.024499 

-.094424 

-.018640  -.106213 

-.003155 

-.099972  .003119 

-.084020  .013430 

-.063927 

7 -.013730 

-.035128 

-.020370  -.065896 

-.018972 

-.081843  -.012896 

-.084711  -.004948 

-.078737 

8 

-.009961  -.023052 

-.016601 

-.047103  -.017614 

-.063086  -.014527 

-.070064 

9 

-.007378 

-.015756  -.013455 

-.034444  -.015687 

-.048977 

10 

-.005570 

-.011134  -.010918 

-.025714 

11 

-.004278 

-.003092 

^Calculated  by  foriiula  (7,5). 


3.  Movinf'-Avorage  Coefficients  (cj)  and  Extension 
Coefficients  (sj)  of  Selected  Moving  Averages 


Macaulay* 

Spencer 

15-Term'’ 

Hoolhouse®  Hardy*' 

Hlgham®  Karup'" 

J a^ 

320c^  a^ 

125c ^ a^  I20c.  a^ 

125c  ^ a ^ 625c . a . 

P 

182 

74 

25 

24 

25 

125 

1 

171 

,919760 

67 

.961572 

24 

,885108 

22 

.739988 

24 

.859550 

114 

.820240 

2 

127 

.393023 

46 

.372752 

21 

.421982 

17 

.386211 

18 

.399283 

87 

.402924 

3 

72 

.055273 

21 

.015904 

7 

,028721 

10 

.124325 

10 

,087040 

53 

.114622 

4 

17 

-.119111 

3 

-.123488 

3 

-.076050 

4 

-.023648 

3 

-.072738 

21 

-.047133 

5 

-17 

-.140462 

-5 

-,125229 

0 

-.107285 

0 

-.08008? 

c 

-.104527 

0 

-.102491 

6 

-19 

-.084512 

-6 

-.075887 

-2 

-.092723 

-2 

-.079459 

-2 

-.093953 

-8 

-.091791 

7 

-10 

-.029971 

-3 

-,025624 

-3 

-.059753 

-2 

-.049327 

-2 

-.055312 

-9 

-.060239 

8 

-1 

-.018003 

-1 

-.019343 

-6 

-,028636 

9 

-2 

-.007496 

^Macaulay  1931«  P«  55i  footnote  2, 

'*Kacaulay  1931*  p.  55?  Henderson  1938,  p,  53. 

*^enderson  1938,  p.  53. 

’Sjenderson  1938,  p,  53?  Benjaiain  and  Haycocks  1970,  p,  238, 
®Hendorsoii  1938,  p.  53. 

^Henderson  1938,  p,  53. 
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3.  Moving-Average  Coefficients  (cj)  and  Ebctenoion 
Coefficients  (a^)  of  Selected  Moving  Averages  (continued) 


Andrews^ 

Spencer. 

21-Term 

Hardy  ^ Vaughan  . 

Wave-Cutting  Formula 

Kenchirv^ton 

i 10080c  ^ a ^ 

65c , a . I440c  , a . 

-.1 J j i_ 

385c  . a . 

- ■ J : 

0 

1688 

60 

5 

182 

45 

1 

1579 

.700747 

57 

.729724 

5 

.480996 

179 

.593256 

44 

.527740 

2 

1325 

.406808 

47 

.408707 

6 

.368708 

170 

.396409 

41 

.370688 

3 

950 

.179749 

33 

.167281 

7 

,267940 

149 

.230238 

36 

.236445 

It 

551 

.027155 

18 

.009255 

7 

.166506 

115 

.096761 

30 

.128638 

5 

225 

-.054586 

6 

-.069703 

6 

, 072964 

72 

-.000057 

22 

. 04311 8 

6 

-k 

-.083701 

-2 

-.091513- 

4 

-.008222 

29 

-.060076 

13 

-.018390 

7 

-124 

-.078256 

-5 

-.076165 

1 

-,075454 

-5 

-.083321 

5 

-.053902 

8 

-135 

-.054360 

-5 

-.CU1905I 

-1 

-.0973S7 

-26 

-.079596 

-1 

-.O67OSO 

9 

-110 

-.031120 

-3 

-.022502 

-2 

-.089039 

-29 

-.056662 

-5 

-.064844 

10 

-61 

-.012428 

-1 

-,006033 

-2 

-,062016 

-19 

-.028557 

-6 

-.050323 

11 

-1 

-.024996 

-6 

-.007595 

-5 

-.032035 

12  -3  -.015626 

13  -1  -.00Wi9 

^Andrews  and  Nesbitt  1965.  F.  10. 

^’Macaulay  1931*  P.  51 1 Henderson  1938,  p.  53. 

^Ben;iamln  and  Haycocks  1970,  p,  239. 

\aughan  1933,  P.  ^37. 

^'Henderson  1930,  p,  53. 
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5.  THE  miITTAKER  GRADUATION  PROCESS 

It  is  not  the  purpose  of  this  paper  to  consider  the  Whittaker  (1923!  see  also 
Henderson  1924)  graduation  process  in  detail.  However,  since  the  natural  method  of  ex- 
tension of  IKA  graduation  to.  the  extremities  of  the  data  was  arrived  at  primarily  on  the 
basis  of  analogies  to  the  Whittaker  method,  the  latter  must  be  described  sufficiently  to 
make  these  analogies  clear.  The  objective  of  the  Whittaker  process  is  to  choose  graduated 
values  u^  ( j = A,  A + 1,  . . . , B)  in  such  a way  as  to  minimize  the  quantity 

B B-s 

y W.(u.  - y.)  + g I (A  u.)  , (5.1) 

where  the  weiglits  W^ , the  positive  constant  g,  and  the  positive  integer  s are  chosen 

a priori  by  the  user.  Die  solution  is  most  conveniently  expressed  in  matrix  notation  as 
follows  (Greville  1957,  1974a) . Lot  W denote  the  diagonal  matrix  of  order  N whose 
successive  diagonal  elements  are  the  W^,  let  u and  y be  defined  as  in  Section  2,  and 

let  K denote  the  rectangular  matrix  of  N - s rows  and  N columns  tliat  transforms  a 
vector  v into  the  vector  of  sth  finite  differences  of  its  components.  Clearly  the  non- 
zero elements  of  K are  binomial  coefficients  of  order  s with  alternating  signs  (Greville 
1974a).  Then,  the  expression  (5.1)  can  be  written  in  the  form 

(u  - y)"*"  W(u  - y)  + g(Ku)'^  Ku  , (5.2) 

where  the  superscript  T denotes  the  transpose.  It  is  easily  seen  (Greville  1974a)  that 

(5.2)  is  smallest  when  u satisfies 

(W  + gK"^  K)u  = Wy  . (5.3) 

It  is  not  difficult  to  show  (Greville  1957,  1974a)  tliat  the  matrix  in  the  left  member  of 

(5.3)  is  norisingular  (in  fact,  positive  definite)  and  therefore 

T -1 

u = (W  H gK  K)  Wy  . 

The  remaining  discussion  will  be  limited  to  the  so-called  "Type  A"  case,  in  which  all 
the  weights  W^  are  taken  eijual  to  unity,  as  this  case  has  the  greatest  similarity  to 
MWA  graduation.  Here  W = I (the  identity),  and  it  is  easily  verified  (Noble  1969,  p.  147) 
that 
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If  the  entire  process  of  graduation,  by  whatever  method  or  criterion,  including  data 
near  the  ends,  is  conceived  in  terms  of  matrix-vector  multiplication  (Greville  1957)  , so 
that 

u = Gy  (5.5) 

for  some  matrix  G,  (5,4)  suggests  that  it  may  Toe  reasonable  to  consider  matrices  G of 
the  form 

G = I - DK  (5.6) 

for  some  square  matrix  D and  some  order  of  differences  s . 

6.  MATRIX  DEVELOPMENT  OF  THE  NATURAL  METHOD  OF  COMPIJITING  THE  GRADUATION 


We  suppose  that  N equally  spaced  observed  values  y^  (j  = A,  A + 1,  . . . , B)  are  to 
be  graduated  primarily  by  mecuis  of  a given  symmetrical  MWA  of  2m  + 1 terms  of  the  form 
(1.2),  that  is  exact  for  the  degree  2s  - 1 . We  assume  that  N > 2m  . In  other  words, 
graduated  values  u^  for  j=A+m,  A+m+l,...  B-m  will  be  calculated  from  the 
given  weiglited  average.  This  requirement  fixes  the  ‘‘lements  of  the  matrix  G of  (5.5)  and 
(5.6)  with  the  exception  of  the  first  m and  the  last  m rows.  The  nonzero  elements  of 
eanh  of  the  remaining  N - 2m  rows  will  be  merely  the  weights  in  the  moving  average  with 
the  middle  weight  on  the  diagonal  in  each  case. 

Our  determination  of  the  elements  of  the  first  m and  the  last  m rows  if  G will 
be  based  on  the  general  requirement  that  these  rows  shall  not  lae  something  extra  grafted 
onto  the  main  part  of  the  matrix,  but  shall  be  an  integral  part  of  an  overall  matrix  having 
a well  defined  structure,  this  structure  having  the  greatest  possible  analogy  to  that  of 
the  corresponding  matrix  for  the  Whitta)ccr  process.  We  shall  try  to  show  that  this  general 


requirement  leads  almost  inexorably  to  tlie  following  tliree  assumptions  about  G for  tlie 
MWA  case: 


(i)  G is  symmetric  and  of  the  form  (5.6); 


h(2)  = [q(z)]”^  = I h.  , (6.2) 

j=-  “ ’ 


convergent  in  an  annulus  containing  the  unit  circle. 

These  three  assumptions  (together  with  the  assumption  stated  in  the  first  paragraph 
of  this  section  about  the  rows  of  G other  than  the  first  m and  the  last  m)  uniquely 
determine  G . The  three  assumptions  require  extensive  discussion,  explanation,  and  com- 
ment, on  which  we  now  embarlt. 

Since  analogy  to  the  Whittaker  process  is  to  have  t)ie  highest  priority,  and  (5.4) 
shows  that  G for  that  process  is  clearly  symmetric  and  of  the  form  (5.6),  these  being 
very  basic  structural  properties,  there  can  bo  little  question  about  assumption  (i)  . This 
assumption  implies  that  G is  a diagonal  band  matrix  of  band  width  2m  + 1 , and  its  ele- 


ments are  now  determined  except  for  a square  submatrix  of  order  m in  the  upper  left 
corner  and  a similar  submatrix  in  the  lower  right  corner.  It  also  implies  that  D is 
symmetric  and  is  a diagonal  band  matrix  of  band  width  2m  - 2s  + 1 . 

It  may  be  mentioned  here  that  there  is  one  basic,  unavoidable  difference  between  the 
Whittaker  process  and  the  MWA  process.  This  is  that,  while  in  the  IMA  process  (with  the 
natural  extension)  G is  a diagonal  band  matrix,  in  the  Whittaker  process  it  is  the  in- 
verse of  such  a matrix.  In  consequence  of  this  difference,  the  Wliittaker  process  is 
"global"  (each  graduated  value  depending  on  al 1 the  observed  values) , while  MWA  is  "local" 
(each  graduated  value  depending  only  on  a few  neighboring  observed  values)  . This  distinc- 
tion carries  over  to  the  related  matrix  D,  wjiich,  in  the  W)iittaker  process,  is  not  a 
diagonal  band  matrix  but  the  inverse  of  such  a matrix  (of  band  width  2s  + 1)  : from  (5.4), 

-1  -1  T 

D=gl+KK  . (6.3) 


Assumption  (i)  fixes  the  elements  of  D except  for  those  in  a square  submatrix  of 


Y 


1 

r 


d.  . 
1-3 


(6.4) 


except  within  the  two  submatrices  mentioned. 

We  define  a ToepXitz  matrix  (see  Trench  1974)  as  one  in  which  all  the  elements  on  any 
diagonal  line  extending  downward  and  to  the  right  are  equal.  In  other  words,  T = (t  ) 

ij 

is  a Toeplitz  matrix  when 


t.  . = t.  . 
13  1-3 


for  all  i and  j . 


-1 


It  is  easily  verified  that  D for  the  Whittaker  process,  given  by  (6.3),  is  a 
Toeplitz  matrix.  In  fact,  if  D ^ = (dj^)  , 

^ ^13  ' 

where  6^^  is  a Kronecker  symbol. 

Now,  it  is  clear  that  the  Toeplitz  property  is  a very  striking  and  obvious  property 
of  those  matrices  which  possess  it.  Thus,  in  pursuit  of  our  goal  of  maximum  analogy  be- 
tween the  Whittaker  and  MWA  processes,  we  would  wish,  if  at  all  possible,  to  make  D ^ a 

Toeplitz  matrix  in  the  MWA  case.  Accordingly,  let  D ^ = (dl  .)  with  d' , “ dl  . for  all 

13  13  1-3 

i and  j . Since  D is  symmetric,  D ^ is  symmetric  and  d*„  • Consider  the 

series 


f(z)  = ][  d'.  z^ 

jra-  eo  ^ 


(6.5) 


(which  may  or  may  not  converge).  Because  of  (6.4)  this  series  is  a "reciprocal"  of  q(z) 
at  least  in  the  formal  sense  that  if  q(z)  and  f(z)  are  formally  multiplied  together, 
the  product  is  unity. 

The  latter  fact  does  not  uniquely  determine  the  series  (6.5).  In  order  to  achieve 


,1 


a unique  determination,  we  invoke  a further  analogy  between  the  MWA  and  Whittaker  processes. 


\ 


We  require  tliat  in  the  MWA  case  this  scries  converge  in  some  region  of  the  complex  plane. 
The  corresponding  series  for  the  Whittaker  case  is  finite,  and  therefore  converges  every- 
where. 


Now,  a Laurent  series  like  (6.5),  if  it  converges  anywhere,  converges  in  an  annulus. 
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circle.  Moreover,  Iq(z)]  has  a Laurent  expansion  (6.2)  convergent  in  an  annulus  con- 
taining the  unit  circle  if  and  only  if  q(z)  has  no  zero  on  the  unit  circle. 


Thus,  assumption  (iii)  is  the  only  possible  assumption  consistent  with  assumptions 
(i)  and  (ii)  that  satisfies  the  requirement  that  (6.5)  converge  in  some  part  of  the  plane, 
and  assumption  (iii)  implies  that  q(z)  has  no  zeros  on  the  unit  circle.  The  prohibition 
against  such  zeros  of  q(z)  was  previously  alluded  to  in  Section  3,  and  further  reasons 
■for  Insisting  on  it  will  l>e  given  in  Section  10. 

In  reality,  the  part  of  assumption  (ii)  that  asserts  the  nonsingularity  of  D is 
redundant,  because  it  is  shown  in  Section  9 that  if  a Toeplitz  matrix  T(=  D ^)  is  con- 
structed in  accordance  with  assumption  (iii),  then  the  square  submatrices  of  order  m - s 
in  the  upper  left  and  lower  right  corners  of  D can  be  chosen  so  that  DT  = I . 

In  the  typical  case  D is  a matrix  of  nonnegative  elements  (this  is  true  in  the 
Whitta)cer  case),  but  this  is  not  a requirement.  (It  is  not  true  of  Hardy's  formula.) 

The  matrix-vector  formulation  does  not  lead  at  once  to  a convenient  method  for  cal- 
culating the  graduated  values  near  the  ends  of  tlie  data.  It  will  be  shown  in  Section  9 to 
be  equivalent  to  the  extension  algorithm  described  in  Section  3,  and  also  to  the  method  of 
calculating  the  atypical  elements  of  G described  in  Section  4. 

7.  SPECIAL  CLASSES  OF  MOVING  AVERAGES 

Of  particular  interest  are  tliose  moving  averages  )cnown  to  actuaries  as  minimum-R^ 
formulas  and  to  economic  statisticians  as  "Henderson’s  ideal"  formulas.  For  a given  number 
of  terms  2m  + 1,  this  is  the  average  (1.2),  exact  for  the  third  degree,  for  which  the 
quantity 

I (i^  c.)^  (7.1) 

j“-m-3  ^ 

is  smallest  (with  the  understanding  that  c^  •=  0 for  |j|  > m)  . The  "smoothing  coeffi- 
cient" Rj  is  defined  as  the  quantity  obtained  by  dividing  (7.1)  by  20  and  ta)cing  the 
square  root.  The  divisor  20  is  chosen  because  this  is  the  value  of  (7.))  for  the  trivial 
case  of  (1.2)  in  v»hich  c^  " 1 and  c^  “ 0 for  J 0 . 
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The  rationale  for  minimizing  (7.1)  may  bo  explained  as  follows  (Greville  1974a).  If, 
for  some  x,  u^,  “x+2'  “x+3  given  by  (1.2),  which  is  the  case  for 

x=A+m  to  B-m-3,  inclusive,  then 

A u . = - y (A  c.)  y . , . (7.2) 

X . , 3 'x+3+3 

3=-m-3  ■' 

It  has  been  customary  to  regard  the  smallness  (in  absolute  value)  of  the  third  differences 
of  the  graduated  values  as  an  indication  of  smoothness.  Therefore  (7.2)  suggests  that 
smoothness  is  encouraged  by  ma)cing  the  quantities  A^  c^  numerically  small,  and  minimizing 
(7.1)  is  a way  of  doing  this.  The  formula  corresponding  to  (7.2)  for  a general  order  of 
differences  is 

m 

A®  u = (-1)®  y (A®  c.)  y . 

X . 3 'x+3+s  , (7.3) 

and  the  general  formula  for  is 

R^  = I (A®  c.)^/(^^®)  . (7.4) 

j=-m- s ^ 

There  is  some  question  whether  Henderson's  contribution  warrants  attaching  his  name 
to  the  "ideal"  weighted  averages.  De  Forest  (1873)  treated  extensively  the  formulas  that 
minimize  R^  . The  concept  of  choosing  the  coefficients  c^  in  order  to  minimize 
seems  to  have  been  .first  mentioned  by  G.  F.  Hardy  (1909).  These  averages  were  fully  dis- 
cussed by  Sheppard  (1913)  slightly  earlier  than  by  Henderson  (1916).  However,  Henderson 
does  seem  to  have  been  the  first  to  give  an  explicit  formula  for  the  coefficient  c^  in 
the  weighted  average  minimizing  R^  (Henderson  1916,  p.  43;  Macaulay  1931,  p.  54: 

Henderson  1938,  p.  60;  Miller  1946,  p.  71;  Greville  1974a,  p.  18).  If  we  write  )t  = m + 2, 
so  that  the  weighted  average  has  2)v  - 3 terms,  the  formula  is 

316I()c  - 1)^  - i^)  ()t^  - .i^)((k  + 1)^  - 3^)(3k^  - 16  - llj^) 

c ^ ; ^ . (7.5) 

■'  8k(k  - 1)  (4k  - 1)  (4k  - 9)  (4k  - 25) 

Weighted  averages  tliat  minimize  R^  have  been  discussed  from  other  points  of  view 
by  Wolfenden  (1925),  Schoenberg  (1946),  and  Greville  (1966,  1974b). 

Also  deserving  of  special  mention  arc  the  averages  (exact  for  rubies)  that  minimize 
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R^,  sometimes  called  "formulas  of  maximum  weight"  or  "Sheppard's  ideal"  formulas.  These 
are  sometimes  applied  to  physical  measurements  when  the  errors  of  observation  can  be  re- 
garded as  random  "white  noise"  (see  discussion  of  "reduction  of  error"  in  Section  8)  . The 
weights  are  given  by 

_ 3(3m^  + 3m  - 1)  - 15 

” (2m  - 1)  (2m  + 1)  (2m  + 3)  ‘ 

Weighting  coefficients  (c.)  and  extension  coefficients  (a.)  for  minimum-R, 

3 3 3 

(Henderson’s  ideal)  averages  of  5,  7,  ....  23  terms  are  given  in  Table  2. 

8.  COIIPARISON  WITH  Ol'HER  METHODS.  PRACTICAL  CONSIDERATIONS 

If  a symmetrical  MWA  exact  for  the  degree  2s  - 1 is  being  used  to  smooth  the  main 
part  of  the  data,  it  can  easily  be  deduced,  either  from  the  extension  algoritlut  described 
in  Section  3 or  from  the  matrix  formulation  of  (5.5)  and  (5.6)  that  the  unsymmctrical 
weightings  proposed  for  smoothing  the  first  m and  the  last  m observations  are  exact 
only  for  the  degree  s - 1 . For  example,  all  the  averages  represented  in  Tables  2 and  3 
with  the  exception  of  Hardy's  are  exact  for  cubics,  and  therefore  their  extensions  to 
values  near  the  ends  are  exact  only  for  linear  functions.  Hardy's  weighted  average  is  ex- 
act for  linear  functions  and  its  extension  only  for  constants. 

The  Whittaker  process  has  a similar  property.  At  a sufficient  distance  from  the 
ends  of  the  data,  polynomials  of  degree  2s  - 1 are  "almost"  reproduced  by  that  process. 

In  support  of  this  rather  loose  statement  the  following  heuristic  argument  is  advanced. 

For  the  Whittaker  process 

G = (I  + gk’^  K)”^  = I - gGk’^  K . 

Thus,  if  y is  the  vector  of  observed  values,  the  vector  of  corrections  to  these  values  is 

T 

- gGK  Ky  . 

T 

Now,  the  nonzero  elements  of  K K,  with  the  exception  of  the  first  s and  the  last  s 

rows,  are  binomial  coefficients  of  order  2s  with  alternating  signs.  Therefore  the  com- 
T 

ponents  of  K Ky,  except  for  the  first  s and  the  last  s,  are  (2s)th  differences  of 
these  of  y (or  their  negatives  if  s is  odd).  Thus,  if  y is  a vector  of  ordinates  of 
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a polynomial  of  degree  2s  - 1,  K Ky  is  a vector  of  zeros  except  for  the  first-  s and 

T 

the  last  s components.  The  components  of  GK  Ky  are  graduated  values  of  those  of 
T 

K Ky,  and  therefore  should  be  very  small  at  some  distance  from  the  extremities  of  the 
data.  Finally,  multiplication  by  g,  even  though  g is  typically  large,  should  give 
small  corrections  at  a sufficient  distance  from  the  ends  of  the  data. 

Some  users  may  consider  the  reduction  in  degree  of  exactness  near  the  ends  of  the  data 
a disadvantage  of  the  natural  method  of  extension.  Before  I became  aware  of  the  natural 
method,  I had  proposed  (Greville  1974a)  a different  method  of  extension  (already  mentioned 
in  Section  2)  that  does  not  have  this  particular  disadvantage  (though  it  has  other  short- 
comings) . This  involves  extrapolation  by  a polynomial  of  degree  2s  - 1 fitted  by  least 
squares  to  the  first  m + 1 observations.  A similar  ixslynomial  is  fitted  to  the  last 
ro  + 1 observations  for  extrapolation  at  the  other  end  of  the  data.  There  may  be  a gain  in 
simplicity  in  using  a single  method  of  extrapolation  for  all  symmetrical  weighted  averages, 
the  particular  extrapolated  values  depending  only  on  the  number  of  terms  in  the  main  for- 
mula. However,  there  is  a loss  in  that  the  extension  method  is  no  longer  tailored  to  the 
particular  symmetrical  average  used. 

Li)ce  the  natural  method  of  extension,  the  method  using  extrapolation  by  least  squares 
can  be  collapsed  into  a single  matrix  G . Wlien  this  is  done,  the  diagonal  band  character 
of  the  smoothing  matrix  is  maintained,  but  the  symmetry  is  lost.  Though  the  matrix  ap- 
proach is  less  convenient  for  computational  purposes,  the  differences  Ixstween  the  two 
methods  are  best  elucidated  by  comparing  the  first  m rows  of  the  respective  matrices  G . 
This  is  done  in  Tables  4 and  5 for  the  case  of  the  9-term  "ideal"  formula.  Here  m = 4 , 
but  for  convenience  the  fifth  row  is  also  shown.  Its  elements  would  bo  repeated  in  the 
subsequent  rows,  moving  successively  to  the  right,  until  we  come  to  the  last  four  rows. 

While  an  average  of  as  few  as  9 terms  would  seldom  be  used  in  practice,  this  is  a con- 
venient illustration. 

As  previously  indicated,  the  first  m rows  and  the  last  m rows  of  G may  l>e  re- 
garded as  exhibiting  unsymmetrical  weighted  averages  which  are  to  t)e  used  near  the  ends  of 
the  data  to  supplement  the  symmetrical  average  used  elsewhere.  The  coefficients  that  ap- 
pear in  the  last  m rows  are  the  same  as  those  in  the  first  m rows,  but  the  order  is 
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i 

i 
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reversed,  both  horizontally  and  vertically.  It  should  be  noted  that  the  coefficients  in 

the  supplemental  averages  depend  only  on  those  of  the  underlying  symmetrical  average.  They  i 

do  not  depend  on  N,  the  number  of  observations  in  the  data  set  (which  is  the  order  of  G) . 

The  coefficients  in  the  supplemental  weighted  averages  based  on  least-square  extrapo- 
lation, exhibited  in  Table  5,  show  two  undesireible  features.  These  are  negative  coeffi- 
cients of  substantial  numerical  magnitude,  and  successive  waves  of  positive  and  negative 
coefficients  as  one  proceeds  from  left  to  right  along  the  rows.  The  number  of  such  waves 
would  increase  as  the  number  of  terms  in  the  underlying  formula  increases. 

In  striking  contrast  is  the  character  of  the  coefficients  of  the  natural  extension. 

Like  the  coefficients  in  the  underlying  symmetrical  formula,  eacli  row  exhibits  a peak  in 
the  vicinity  of  the  main  diagonal  of  the  matrix,  tapering  off  to  a single  group  of  nega- 
tive coefficients  of  reduced  size  near  the  edge  of  the  diagonal  band. 

In  the  least-squares  method  only  a very  small  correction  is  made  to  the  initial  ob- 
served value.  The  cori'esponding  correction  in  the  natural  method  is  more  substantial. 

The  "second-difference  correction"  is  the  coefficient  of  the  second-difference  term 
when  the  formula  is  expressed  in  terms  of  increasing  orders  of  differences  in  the  form 


c A y 


x-h 


The  coefficient  c does  not  depend  on  the  subscript  x-h,  in  which  there  is  some  free- 
dom of  choice.  Fo.'  the  formulas  based  on  least-squares  extrapolation,  which  are  exact  for 
cubics,  the  fourth-difference  correction  is  similarly  defined. 

Some  writers  (Miller  1946,  Wolfenden  1942,  Greville  1974a)  h.ivo  regarded  the  ob- 
served values  y as  the  sum  of  "true"  values  U and  superimposed  random  errors  e 
x X X 

If  it  is  assumed  that  the  errors  e for  different  x are  uncorrelated,  and  have  zero 

X 

2 

mean  and  constant  variance  a for  all  x,  then  the  variance  of  the  error  in  the  smoothed 
2 2 2 

value  is  o , where  is  obtained  by  taking  s = 0 in  (7.4).  Thus,  R^  may 

be  interpreted  as  the  ratio  of  reduction  in  the  standard  deviation  of  error  that  results 
from  application  of  the  weighted  average. 

While  the  assumptions  underlying  the  preceding  analysis  may  be  questioned,  never- 
theless a good  case  can  be  made  that,  for  any  weighted  average,  R should  be  less  than 
unity.  Since  R^  is  the  sura  of  the  squares  of  the  coefficients  in  the  average,  Rq  can 
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age  cannot  be  considered  satisfactory  if  the  absolute  value  of  any  coefficient  is  equal  to 
or  greater  than  unity. 

As  indicated  in  Section  7,  it  has  long  been  customary  to  regard  a graduation  as  smooth 


if  the  third  differences  of  the  graduated  values  are  small  in  absolute  value, 
we  have 


If  G = (g^^)  , 


N 

Vi-1  “ . 

:=i 

and  therefore 


'^A+i-1  ^ ^A+j-1  *i  ^ij 


where  the  subscript  of  A indicates  that  the  differences  are  taken  with  respect  to  i 


(i.e.,  down  the  columns  of  the  matrix).  If  one  avoids  the  corner  submatrices,  the  nonzero 
elements  g^^  in  (8.1)  are  merely  coefficients  in  the . underlying  symmetrical  average,  and 
(8.1)  reduces  to  (7.3).  This  was  the  rationale  underlying  the  derivation  of  the  minimum- 


averages. 

Of  course,  if  G is  symmetric,  it  makes  no  difference  whether  the  differences  are 
taken  horizontally  or  vertically,  vnien  the  symmetry  of  G is  not  assumed,  care  must  be 
exercised.  Many  years  ago  (Greville  1947,  1948)  I published  what  purported  to  be  coeffi- 
cients in  supplemental  averages  to  be  used  near  the  ends  of  the  data  in  conjunction  with 
minimum-R^  and  minimum-R^  symmetrical  averages.  The  symmetry  of  G was  not  assumed,  and 
I made  the  error  of  deriving  the  unsymmetrical  coefficients  by  minimizing  their  third  dif- 


ferences taken  horizontally.  The  tables  in  question  are  therefore  based  on  an  incorrect 


assumption.  Further  it  may  be  mentioned  in  passing  that  in  the  1947-8  formulation  the 
diagonal  band  character  was  not  maintained,  since  the  supplemental  averages  contained  the 
full  2m  > 1 terms. 

Table  6 shows,  for  the  natural  and  the  least-squares  extensions  of  the  9-term  mini- 
mum-Rj  formula,  those  third  differences  of  the  matrix  elements,  taken  vertically,  that 
involve  elements  of  the  first  five  rows.  The  entries  in  the  fifth  row  of  Table  6 would 


1 


be  repeated  in  subsequent  rows,  moving  successively  to  the  right.  Casual  inspection  of 
the  table  shows  that  the  third  differences  are  numerically  smaller  for  the  natural  exten- 
sion. All  of  these  third  differences  are  less  than  0.14  in  absolute  value.  Two  of  those 
for  the  least-squares  extension  exceed  0.7  in  absolute  value. 

It  is  instructive  to  compare  the  natural  extension  with  the  least-squares  extension 
for  the  numerical  example  of  Section  3.  Though  neither  extension  is  recommended  for  use 
when  additional  data  are  available  beyond  the  range  of  the  original  data  set,  nevertlieless 
it  may  be  of  interest,  purely  for  purposes  of  illustration,  to  choose  a numerical  example 
in  which  such  additional  data  are  available,  and  this  has  been  done. 

Table  7 and  Figures  B and  C show,  for  the  first  seven  months  of  1967  and  the  last 

seven  months  of  1971,  the  observed  values  of  precipitation  in  Madison,  Wisconsin,  and  the 
graduated  values  obtained  by  <i)  natural  extension  of  Spencer's  15-term  average,  (ii) 
least-squares  extension  of  the  same  average,  and  (iii)  use  of  additional  data.  It  will  be 
noted  that  the  least-squares  extension  is  strongly  constrained  toward  each  of  the  two 
terminal  observations  (January  1967  and  December  1971) , Tliis  may  be  explained  by  the  fact 
that  all  the  values  y^  in  11.2)  that  enter  into  the  calculation  of  these  graduated  values 

are  included  in  either  the  m + 1 observations  to  which  the  least-squares  cubic  was  fitted 

or  the  m extrapolated  values  obtained  from  the  same  cubic.  On  the  other  hand,  the  natural 
extension  and  the  least-squares  extension  are  very  close  together  at  the  interface  with  the 
graduated  values  calculated  in  the  standard  manner.  Thus,  for  the  months  of  July  1967  and 
June  1971,  all  but  one  of  the  values  y^  entering  into  the  computation  (1.2)  are  identical 
for  the  two  methods. 

For  the  months  closer  to  the  interface  the  graduated  values  obtained  by  introducing 
additional  data  are  close  to  those  of  the  natural  extension.  This  is  because  the  supple- 
mental unsymmetrical  averages  produced  by  the  natural  extension  (unlike  those  of  the  least- 
squares  extension)  give  relatively  small  weight  to  the  observations  more  remote  from  the 
one  being  graduated  (as  does  the  underlying  symmetrical  formula).  For  example,  the  values 
for  the  natural  extension  and  those  obtained  by  the  use  of  additional  data  are  indistin- 

I 

gulshablc  in  Figure  B for  April  to  July  1967.  in  the  last  months  of  1971  the  deviation  j 

is  greater  because  the  first  two  months  of  1972  were  exceptionally  dry.  This  could  not  j 

have  been  predicted  from  the  data  for  preced 


7.  Extension  of  15-Terni  Spencer  Graduation  of  Madison  Precipitation 
Data  to  First  Seven  and  Last  Seven  Months  by  Different  Methods 


Extension 

of  Graduation  by 

Year  and 
Month 

Obcerved 

Value 

Natural 

Method 

Least-Squares 

Cubic 

Additional 

Data 

1$67 

January 

1.63 

1.11 

1.62 

1.56 

February 

1.17 

1.63 

0.98 

1.84 

March 

1.49 

2.24 

1.37 

2.29 

April 

2.57 

2.88 

2.32 

2.85 

May 

3.53 

3.42 

3.07 

3.36 

June 

6.46 

3.74 

3.61 

3.70 

July 

2.51 

3.85 

3.82 

3.84 

1971 

June 

2.27 

2.02 

2.00 

2.05 

July 

1.65 

2.13 

2.03 

2.23 

August 

3.96 

2.24 

2.00 

2.39 

September 

1.87 

2.40 

1.97 

2.51 

October 

1.30 

2.63 

2.08 

2.50 

November 

3.48 

2.84 

2.58 

2.31 

December 

3.64 

3.28 

3.85 

2.04 
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C.  Observed  and  Graduated  Values  of  Monthly 
Precipitation,  Madison,  Wisconsin,  July  to  December,  1971 


Table  8 gives  certain  parameters  for  the  various  symmetrical  weighted  averages  that 
have  been  mentioned  previously.  The  column  headed  "Error"  requires  explanation.  This  is 


I 


1 

I 

i 


the  error  committed  when  the  formula  in  question  is  used  to  "smooth"  a polynomial  of  degree 
four.  This  naturally  tends  to  increase  with  the  number  of  terms  in  the  formula.  Both 
and  R^  tend  to  decrease  with  increasing  number  of  terms.  Though  the  "ideal"  formulas 
have  been  derived  to  minimize  R^,  they  tend  to  produce  small  values  of  R^  as  well.  In 
only  one  instance  (Vaughan)  does  a "name"  formula  have  a smaller  R^  than  the  ideal  formula 
of  the  same  nuntber  of  terms.  The  late  Hubert  Vaughan  was  an  unusually  )<oen  analvst  of  MWA 
smoothing. 

It  may  be  mentioned  in  passing  that  some  writers  (e.  g. , Henderson  1938)  call  the  re- 
2 

ciprocal  of  R^  the  "weight"  and  the  reciprocal  of  R^  the  (smoothing)  "power." 

9.  PROOF  OF  EQUIVALENCE  OF  THE  MATRIX  AND  INTERMEDIATE- VALUE  APPROACHES 

Though  this  proof  involves  only  elementary  mathematics,  it  is  fairly  long  and  comp- 
licated, and  is  therefore  organized  in  the  form  of  three  lemmas  and  a theorem. 

Let 

m-s 

, , m-s  v m-s-3 

p(z)  = z - Z.  P^  z 

j=l  3 

where  p(z)  is  the  polynomial  defined  in  Section  3 whose  zeros  are  the  zeros  of  q(z) 
located  inside  the  y-nit  circle. 


Leima  9.1. 


The  quantities 


of  (6.2)  satisfy  the  recurrence 


h. 

3 


n-5 

^ Pf  ’’j- 
1=1  J 


for  all  positive  j . 

Proof.  In  an  annular  region  containing  the  unit  circle,  we  have 

h(z)  q(z)  = 1 . 

But,  for  a suitable  (nonvanisliing)  constant  c , 

q(z)  = cp(z)  p(z  ^)  . 


(9.1) 


(9.2) 


In  fact,  c = -o  /p  . Therefore, 
Tn-E  m-s 


1 


i 


i 

I 
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8,  Parametere  of  the  Symnetrlcal  Weighted  Averages  Listed  In  Tables  2 and  3 


Designation 

Number 
of  Torns 

R 

0 

"3 

Error 

Kiniiitun-R^  (Henderson's  ideal) i 

5 

.7045 

.2735 

4 

-.0736 

7 

.5971 

.114? 

-.296*^ 

9 

.5323 

.0581 

-.766'* 

11 

.4865 

.0331 

4 

-1.576 

13 

.'^515 

.0204 

-2.886** 

15 

.4234 

.0134 

-4.656** 

17 

.4002 

.0095 

-7.646** 

19 

.3806 

.0066 

-ll.iid** 

21 

.3636 

.0048 

4 

-16.56^ 

23 

.3488 

.0036 

-23.16** 

I’^caulay 

15 

.4273 

.01657 

-4.526** 

Spencer 

15 

.4389 

.01659 

-3.666** 

Voolhouse 

15 

.4602 

.0654 

-5.46** 

HKrdy 

17 

.4059 

0 

0 

• 

~ 6^  - 3.706** 

Hlghaa 

17 

.4127 

.0179 

-6.46** 

Karup 

19 

.4036 

.0095 

-7.86** 

Andrews 

21 

.3707 

,00628 

-14.96** 

Spencer 

21 

.3784 

.00626 

-12.66** 

Hardy,  wave-cutting 

23 

.3332 

.0154 

-48.86** 

Vaughan  A 

23 

.3*^15 

.0050 

-26.66** 

Xenchlngton 

27 

,3202 

,0031 

-22.46** 
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(9.3) 


cp(2)  p(z  ) h(z)  » 1 . 


-1 


Now,  Ip(z)l  has  an  expansion  in  negative  powers  of  z,  with  exponents  not  greater 
than  -m  + s,  whose  region  of  convergence  contains  the  unit  circle.  Call  it  b(2)  . Then, 


-1 


cp(z  ) h(z)  = b(z)  , 


from  which  it  follows  that  (9.1)  holds  for  all  positive  j,  and  the  proof  is  complete. 
Let  denote  the  (unknown)  square  subnatrix  of  order  m - s in  the  upper  left 


corner  of  D . Let  P » (P^^j)  b®  * matrix  of  m - s rows  and  2m  - 2s  columns  defined 


by 


for  i > j 
for  i = j 


-p.  . for  0<j-i<m-s 
1-1  - 


for  j - i > m - s . 


Lot  P t)e  partitioned  in  the  form  (P^^  **2^'  ''here  P^^  and  P^  are  square.  Let 


T » denote  the  Toeplitz  matrix  of  order  N - s defined  by 


’'ij  ° ’’i-j  • 


-1 


In  other  words,  T is  D under  assumption  (iii)  of  Section  6. 


Leirnia  9.2.  T is  nonsingular  and  equal  to  D ^ if  D is  completed  by  assigning 


Dii-cpJPl  , 


together  with  a corresponding  assignment  of  the  square  submatrix  of  order  m - s in  the 
lower  right  corner  of  D . 


Proof.  Note  that  if  we  try  to  form  the  product  DT,  all  elements  of  the  product 
that  do  not  involve  the  missing  elements  in  the  corners  of  D have  the  correct  values 
( 0 or  1)  . We  shall  focus  on  the  upper  left  corner;  similar  considerations  apply  to  the 
lower  right  corner.  Hie  lemma  will  be  proved  if  it  can  be  shown  that  the  product  DT  is 
indeed  the  Identity  if  (and  its  counterpart  at  the  lower  right)  is  chosen  in  the 

manner  indicated. 


Djj  denote  the  squ.-tro  subroatrix  of  D of  order  m - s immediately  to  the 


right  of  let  be  the  submatrix  corresponding  to  in  the  upper  left  corner 

of  T,  and  o"®  immediately  to  its  right.  By  symmetry  the  square  submatrix  of 

T 

order  m - s immediately  below  is  • The  product  DT  will  be  the  identity  if 

is  such  that 

D,,  T 4 D = I (9.4) 

11  11  12  12 

(and  if  a similar  relation  holds  in  the  lower  right  corner) . 

It  is  easily  verified  that 

Di2  = cbJ"  . (9.5) 

Let  ^ square  matrix  of  order  ra  - s defined  by 

d.  . = q.  . 

13  1-3 

The  reader  will  note  that  replacement  of  by  (and  a corresponding  replacement 

in  the  lower  right  corner)  would  make  D a Toeplitz  matrix.  It  follows  from  the  definition 
of  h(2)  in  (6. 2)  and  the  Toeplitz  character  of  the  matrices  involved  that 

■^12  ^ ^11  ^11  " '’12  '^12  = ^- 

(The  reader  may  think  of  the  block  immediately  below  as  moved  up  to  the  left  of  , 

eind  the  block  Tj^2  moved  to  a position  immediately  above  T^^^  .)  It  is  clear  from  (9.6) 
that  (9.4)  will  bf  satisfied  if 

T 

D T 4D  T =D  T . (9.7) 

12  ^12  11  11  11  11  ' ' 


It  is  easily  verified  that 


*^11  = "’I 


:■[':]  ■ 


T T 

p p + p p 
*^2  2 11 


Substitution  of  this  resuDt  and  (9.5)  in  the  left  member  of  (9.7)  gives 


"‘'’2  ”1  ’'12 " '’I  '’2  ^1 " '’i  "^ii’  • 


P T 4 P T = P 
1 12  2 11 


577 


by  Lemma  9.1.  Thus  (9.8)  reduces  to  cP^  p T , and  (9.7)  is  satisfied  if  D - ep’’ 

1 X IX  II  1 *^1 

This  completes  the  proof, 

T 

Let  L denote  the  m by  N suhmatrix  of  I - G » K DK  consisting  of  the  first  m 
rows,  and  let  A = (a^^j)  be  the  m by  N matrix  defined  by 


0 
1 


for  i > 3 
for  i = j 


’ij 


-a . . for  0 < j - i < m 

3-1 

0 for  j - i > m , 


(9.9) 


where  the  coefficients  a^  were  defined  in  (3.4).  Let  denote  the  square  submatrix 

of  A consisting  of  the  first  m columns. 


Lemma  9.3. 


L = cAj  A 


(9.10) 


Proof.  Let  denote  the  submatrix  of  D consisting  of  the  first  m rows,  and 

let  denote  the  square  submatrix  of  order  m in  the  upper  left  corner  of  K . Then 

T 

It  follows  from  the  placement  of  zeros  in  K that 

L=K^^D^K  . 

Let  P denote  an  m by  N - s matrix  with  the  elements  defined  as  in  P (following  the 
proof  of  Lemma  9.1).  It  is  easily  verified  that 

A = PK  . 

A A 

Let  Pj  denote  the  square  submatrix  of  P consisting  of  the  first  m columns.  Then 

T T 

A^  “ ^ . 


Thus, 


cA^  A = cK^j  PK  . 

aT  a 

But  it  follows  from  the  proof  of  Lemma  9.2  that  P * and  so 

CAi  A . Dj  K = L , 
as  required  for  the  proof  of  the  lemma. 
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Theorem  9.1.  The  extension  method  of  Section  3 and  the  matrix  formulation  of  Section 
6 are  equivalent. 

Proof.  Lot  denote  the  subroatrix  of  A consisting  of  the  (m  + ))th  to  (2m) th 

columns  and  let 

A = [Aj^  A^l  . 

Let  denote  the  vector  of  the  m intermediate  values  obtained  from  the  observations 

by  (3.5),  let  and  respectively,  denote  the  vectors  of  the  first  m obser- 

vatiorvs  and  the  (n\  + l)th  to  (2nv)th  observations!  and  let  y denote  the  vector  con~ 
sisting  of  y^°'  followed  by  y^^'  . Then,  the  extension  method  requires  Ay  = 0,  or 

\ = -^2 


Let  G = (g^.)  be  the  square  matrix  of.  order  m defined  by  g^^  = c._^  (where  the 

coefficients  c.  were  defined  in  (1.2)),  and  let  G.  be  the  submatrix  of  G formed  from 

3 

the  first  m rows  and  the  (m  + l)th  to  {2m)  th  columns.  Then  the  vector  of  tlie  first 
m graduated  values  from  the  matrix  formulation  is,  by  Lemma  9.3, 


y<^>  - CA^(A^  y^^>  t A,  . 


By  the  extension  method,  the  corresponding  vector  is 

0^,  . o v‘'>  . 0,2  >■"’ 


But,  since  G « I - K DK  , 


G * I - c(A, 


[y  - ^ "i 


Gi2  -CA^  A^ 


Thus,  (9.13)  reduces  to 


y‘^>  - CA^ 


(0)  T (1)  ,T  ^ (1)  ^ -T  „(2) 

Aj^  y + Aj^  Aj^  y + A^  A^  y + A^  A^  y J 


The  substitution  (9.11)  rcduco.s  this  to  (9.12),  as  required. 


We  note  in  passing  that  the  computational  short  cut  involving  extended  values  has  an 
analogue  in  the  case  of  Whittaker  smoothing.  Especially  in  actuarial  literature,  the 


Whittaker  smoothing  process  is  sometimes  called  the  difference-equation  method  because  the 
difference  equation 

u + (-1)^  6^®  u = y (9.14) 

X X X 

holds  for  x=A+s,  A+s+1,  ...,B-s.  It  was  pointed  out  by  Aitken  (1926)  that 
(9.14)  is  satisfied  for  x = A,  A + 1,  ...,  B if  we  introduce  at  each  end  of  the  data  set 


E extrapolated  values  of  both 

and 

u 

X 

satisfying 

t)ie  conditions 

“x  = ^x 

(X 

= A 

- h 

X = B + j: 

j = 1. 

2 

s) 

A®  u = 0 

Y. 

(X 

= A 

- j* 

X = B - j; 

j = 1. 

2,  .... 

s) 

However,  tliis  observation  is  not  helpful  from  a computational  point  of  view.  The  attempt 
to  utilize  it  merely  increases  tlic  order  of  the  linear  system  to  be  solved  from  N to 
N + 2s  . 

10.  THE  CHAKACTERISTIC  FUNCTION  AND  SCHOENBERG'S 
DEFINITION  OF  A SMOOTHING  FORMULA 

Schoenberg  (1946)  defined  the  characteristic  function  of  the  MWA  (1.2)  as 

m 

4>(t)  = I c.  e’’  . (10.1) 

j=-m  ^ 

For  a syiiimotrical  MWA  this  is  a real  function  of  the  real  variable  t,  and  can  be  ex- 
pressed in  the  alternative  form 

m 

ijv(t)  - c.  cos  jt 

j=-m  ^ 

It  is  periodic  with  period  2ii  and  equal  to  unity  for  t = 2iin  for  all  integers  n . 

The  effect  of  MWA’s  in  eliminating  or  reducing  certain  waves  has  been  noted 

(Elphinstone  1961,  Hannan  1970).  If  the  input  to  tlie  .smoothing  process  is  a sine  wave, 

which  may  be  represented  in  the  form 

y “ C cos(rx  + h)  , (10.2) 

x 
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) 

i 


it  can  be  shown  by  simple  algebraic  manipulation  that 


u = y 4)(2ii/P)  , 

X X 

where  P = 2'n/r  is  the  period  of  y^  . Thus,  if  ^(2v/P)  = 0,  the  wave  is  annihilated 
by  the  smoothing  process;  the  amplitude  is  severely  reduced  if  it  is  close  to  zero.  Thus 
MWA  smoothing  is  related  to  the  "filtering"  processes  considered  by  Wiener  (1949)  and 
others. 

Schoenberg  (1946)  defined  a smoothing  formula  as  an  MWA  whose  characteristic  function 
♦(t)  satisfies  the  condition 

U(t)  I 1 1 (10.4) 

for  all  t , Later  (Schoenberg  1940,  1953)  he  suggested  the  stronger  condition 

[♦(t)  1 < 1 (0  < t < 2ii)  . (10.5) 

C.  Lanczos  (see  Schoenberg  1953)  pointed  out  that  condition  (10.4)  is  obtained  by  requiring 
that  every  simple  vibration  (10.2)  be  diminished  in  amplitude  by  the  transformation  (1.2). 
The  results  of  Section  6 of  the  present  pai^cr  suggest  an  alternative  definition  of  a smooth- 
ing formula.  Using  the  subscript  N to  emphasize  the  fact  that  the  order  of  G is  the 
number  of  observations  in  the  data  set,  we  may  say  that  (1.2)  is  a smoothing  formula  if 

g“  = lim  g"  (10.6) 

Tf*  ” 

exists  for  all  N.  Schoenberg  (1953,  footnote  3)  suggested  a relationship  between  (10.4) 
and  the  conditions  for  existence  of  the  infinite  power  of  a matrix  (Oldenburger  1940, 

Dresden  1942),  but  he  did  not  cla)x>rate  the  connection.  We  shall  show  that  the  existence 
of  the  limit  (10.6)  for  all  N is  equivalent  to  a condition  intermediate  between  (10.4) 
and  (10.5).  The  following  leirana  will  help  to  elucidate  the  situation. 


Lemma  10.1.  For  a given  t in  (0,  2;;)  , i(i(t)  =1  if  and  only  if  q(T)  = 0 . 
Proof.  Prom  (3.1).  (3.2).  and  (10.1)  it  follows  that 


There  are  two  ways  in  which  equality  can  hold  in  (10.4),  namely  i)i(t)  = 1 . and 
if(t)  = -1,  and  the  situation  is  different  in  the  two  cases.  Lenuna  10,1  shows  that  if 
♦ (t)  = 1,  q(z)  has  a zero  on  the  unit  circle  and  consequently  is  not  defined.  No 

such  problem  arises  if  4’(t)  - ”1  . We  are  therefore  led  to  the  intermediate  condition 

-1  £ ((>(t)  <1  (0  < t < 2ii)  , (10.8) 

which  we  shall  show  to  be  equivalent  to  the  existence  of  (10.6) . 

Lemma  10.2.  If  (10.8)  holds,  D is  positive  definite. 

Proof . From  (9.2)  we  obtain 

q(l)  = clp(l)]^  . 

It  follows  from  (10.7)  and  (10.8)  that  q(e^'')  is  positive  for  0 < t < 27r  . Since  it  is 
a continuous  function  of  t,  it  is  nonnegative  for  t = 0:  that  is,  q(l)  is  nonnega- 
tive. By  the  definition  of  p(z),  p(l)  ^ 0,  and  c - -q_  /p  does  not  vanish. 

Tn^S  ID”  s 

Therefore  q(l)  is  positive  and  c is  positive. 

Let  the  expansion  of  b(z)  of  Section  9 be  given  by 

«o 

b(z)  = I b.  z’’  . (10.9) 

j=m-s  ^ 

; It  follows  from  (9.3)  that  on  the  unit  circle 

f 

I b(z}  b{z  = ch(z)  . (10,10) 

I Subctitu  ion  of  (10.9)  gives 

1 

1 ® 


I b b^  = ch  (j  = ...,  -1,  0,  1,  ...)  . (10.11) 

l=m-s  ^ ^ 


for  some  constant  coefficients  . Therefore  the  series  in  the  left  member  of  (10.11) 
is  absolutely  convergent, 

liCt  B = (b.  .)  denote  the  ma-.ruc  of  N - s rows  and  a denumerable  infinity  of 
columns  given  by 


(i  > j) 


b . . (i  < j)  . 
m-s+]-i  — 


(10.12) 


It  follows  from  (10.11)  and  (6.1)  that 


-1  T 

D = cBB 


(10.13) 


The  structure  of  the  right  member  of  (10.13)  shows  that  D is  nonnegative  definite; 
since  it  is  nonsingular,  it  is  positive  definite,  and  consequently  D is  positive  definite, 
as  required. 


Let  i(i(t)  be  defined  by 


Then  (10.8)  is  equivalent  to 


V<(t)  = 1 - ^.(t)  . 


0 < i()(t)  < 2 


(0  < t < 2'n)  . 


(10.14) 


Let  i|)  denote  the  maximum  value  of  il)(t)  , and  let  A = (a.  .)  be  the  square  matrix 
max  T , , , , 

of  order  N defined  by  (9.9).  let  |v|  denote  the  Euclidean  norm  of  a vector  v . 

Lcmi.ia  10.3.  If  v is  any  vector  of  N real  components  and  tKt)  is  iiositive  in 

(0,  2ii)  , 


1av|^/|v|^  1 c ^ 


(10.15) 


Proof.  Let  V be  an  arbitrary  vector  of  N real  components,  with  jth  component 


v^  , and  let 


V(t)  = J V.  e^ 


(10.16) 


be  the  characteristic  function  of  v . Then,  if  a(2)  is  the  polynomial  defined  by  (3.4). 

a(e  ) V(t)  = J «.  e ^ , 

j-1  3 


whore,  for  j ^ ro,  is  the  (j  - m)th  component  of  Av  . Moreover,  it  follows  from 


(3.4),  (9.2),  and  (10,7)  that 


^(t)  = 08(0^*^)  a(e  ^^)  . 


(10.17) 


By  Parseval's  formula  (Schoenberq  1946) 


= 27  K 


(10.18) 


Similarly,  in  view  of  (10.17), 


l^'^l  |v(t)l^  dt  = ^ — / i(i(t)  lv{t)l^  dt 


(10.19) 


From  (10.18)  and  (10.19),  (10.15)  follows. 

-T  ' 

It  IS  easily  verified  that  the  symmetric  matrix  A A is  indentical  with  F = I - G 

except  for  the  elements  of  the  square  subroatrix  in  the  lower  right  corner.  In  fact,  the 
. -T  ' 

elements  in  the  lower  right  corner  of  A A are  such  that  the  entire  matrix  becomes  a 

Toeplitz  matrix  if  the  first  m rows  and  the  first  m columns  are  deleted.  It  follows 

"•T  - 

that  F can  bo  obtained  from  cA  A by  subtracting  a square  matrix  of  order  N whose 
elements  are  all  zero  except  for  the  square  submatrix  of  order  m in  the  lower  right 


In  fact,  if  Aj^  and  A^  are  defined  as  in  the  proof  of  Theorem  9.1,  it  is  easily 

-’T 

verified  that  the  square  submatrix  of  order  m in  the  lower  right  corner  of  cA  A is 


given  by 


T T 

A^  + Aj^  Aj^)  , 


while  the  corresponding  submatrix  of  F is  cAj^  A^^  . If,  therefore,  A is  defined  as 
the  square  matrix  of  order  N having  A^^  in  the  lower  right  corner  and  zeros  everywhere 
else,  we  have 


Lemma  10.4. 


^T  ' — T — 

F«=l-G=c(A  A-A  A) 


Before  stating  the  theorem  that  is  the  main  result  of  this  section,  we  point  out 
(Oldenbergor  1940,  Dresden  1942)  that,  for  a given  matrix  C,  lim  c"  exists  if  and  only 
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if  either  all  eigenvalues  of  C lie  within  the  unit  circle,  or  else  1 is  a siiople  zero 
of  the  minimum  polynomial  of  C and  all  other  eigenvalues  lie  within  the  unit  circle.  As 
multiplication  by  G leaves  unchanged  vectors  whose  components  are  successive  equally 
spaced  ordinates  of  a polynomial  of  degree  s - 1 or  less,  it  is  clear  that  1 is  an 
eigenvalue.  As  G is  symmetric,  all  its  eigenvalues  are  real,  and  all  zeros  of  its  mini- 
mum polynomial  (including  1)  are  simple.  Therefore  the  limit  (10.6)  exists  if  and  only  if 
all  eigenvalues  of  G other  than  1 are  strictly  between  -1  and  1 . 

Theorem  10.1.  The  limit  g“  exists  for  all  N if  and  only  if  the  characteristic 
function  ij(t)  satisfies  the  condition 

-1  _<  i)i(t)  <1  (0  < t < 2ti)  . (10.8) 


Proof.  Let  (10.8)  hold,  and  recall  that  (10.8)  is  equivalent  to  (10.14).  Now  we 

shall  consider  a particular  value  of  N and,  for  convenience,  drop  the  subscript  N . 

The  eigenvalues  of  F are  obtained  by  subtracting  from  1 those  of  G . We  need  to  show, 

therefore,  that  the  nonzero  eigenvalues  of  F are  positive  and  do  not  exceed  2. 

T 

Since  F = K DK  and  D is  positive  definite  by  Lemma  10.2,  F is  nonnegative  def- 
inite. Therefore  its  nonzero  eigenvalues  cire  positive.  Let  v be  an  arbitrary  nonzero 
real  vector  and  consider  the  Rayleigh  quotient. 


r 


T 

V Fv 


T -T  - 
V A Av 

T 

V V 


c 


T tT  - 
V A Av 

T 

V V 


by  Lemma  10.4.  Since  A A is  nonnegative  definite,  we  have 

r < cIavI^/IvI^  < il) 

by  Leiuna  10.3.  Therefore  (10.14)  gives  r ^ 2 . Since  the  spectral  radius  is  the  maximum 
value  of  the  Rayleigh  quotient,  this  completes  the  first  part  of  the  proof. 

We  shall  prove  the  converse  by  showing  that  if  (10.8)  fails,  the  limit  (10.6)  does 
not  exist  for  some  N . There  are  two  ways  in  which  (10.8)  can  fail.  Either  iKt)  may 
be  equal  to  1,  or  [♦(t))  may  exceed  1 . In  the  former  case,  as  was  pointed  out  im- 
mediately following  Lemma  10.1,  G is  not  defined. 


'i 

dl 


( 


I 


i 


1 
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I 


We  consider  therefore  the  case  in  which  [♦(t)]  > 1 for  some  t = x,  and  let  v be 
the  N- vector  whose  jth  component  is 

N + 1. 


= exp[iT(j 


-)]  . 


(10.20) 


Using  an  asteris)c  to  denote  the  conjugate  transpose,  we  have  v^  v^  = 1,  and  therefore 

V V ■=  N « (10,21) 

Except  for  the  first  m and  the  last  in  components  we  have 

(Gv)j  = 4>(t)  , 

and  so 


v Gv  = 4i(t)  (N  - 2m)  + C , 


(10.22) 


where  C denotes  the  contribution  of  the  first  m and  the  last  m components.  Because 
of  the  symmetry  of  both  the  matrix  elements  and  the  vector  components,  C is  real.  Since 
all  the  vector  components  have  absolute  value  1,  an  upper  txiund  to  C is  the  sum  of  the 
absolute  values  of  the  elements  in  the  first  m and  the  last  m rows  of  G . Call  this 
Cj  . We  recall  that  does  not  depend  on  N . 

Now  choose  N sufficiently  large  so  that 

+ 2ml(!)(T)  1 


N > 


UCt) 


Then 


and  it  follows  that 


Consequently, 


NI|*(t)  I - 1]  > + 2m|(KT)  I , 

(N  - 2m)  jiKx)  1 > N + Icl  . 

I (N  - 2m)  iKt)  + C|  > N , 


and  therefore,  by  (10.21)  and  (10.22), 


V Gv 


> 1 


It  follows  that  the  spectral  radius  of  G is  greater  than  unity,  and  the  proof  is  complete. 


i 
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It  is  easily  verified  that  G , when  it  exists,  is  the  orthogonal  projector  on  the 
eigenspace  of  G associated  with  the  eigenvalue  1,  that  is,  the  space  of  N-vectors 
whose  components  are  successive  eq\ially  spaced  ordinates  of  polynomials  of  degree  s - 1 
or  less. 

11.  SMOOTHING  FORMULAS  IN  THE  STRICT  SENSE  AND  AN  OPTIMAL  PROPERTY 

At  cui  early  stage  of  the  investigations  underlying  this  paper  I was  trying  to  explain 
the  natural  extension  of  the  MWA  graduation  to  my  colleague,  I.  J.  Schoenberg,  whose  work 
plays  such  an  important  role  therein,  and  he  asked  me  (I  thought  with  a slight  show  of 
impatience)  "What  does  it  minimize?"  My  answer  was  that  it  doesn't  minimize  anything,  but 
is  just  a natural  way  of  extending  the  MWA  graduation  to  the  ends  of  the  data.  This  was 
too  simplistic  an  answer,  for  we  shall  now  show  that  it  does  in  fact  minimize  "something." 

In  a slightly  more  general  form  of  the  Whittaker  smoothing  method  iGreville  1957)  one 
minimizes  the  sum  of  the  squares  of  the  departures  of  the  smoothed  values  from  the  ob- 
served values  plus  a designated  positive  definite  quadratic  form  in  the  sth  differences 
of  the  smoothed  values.  In  other  words,  one  minimizes 

(u  - y)''  (u  - y)  + (Ku)"^  HKu  , 

where  H is  a given  positive  definite  matrix  of  order  N - m r Minimization  of  this  ex- 
pression leads  to  the  equation 

(I  + k’’  HK)u  = y , 

T 

which  has  a unique  solution  for  u since  I + K HK  is  positive  definite.  I showed 

(Greville  1957)  that  this  graduation  method  has  the  interesting  property  that  if  roughness 

T 

(opposite  of  smoothness)  is  measured  by  the  term  (Ku)  HKu,  smoothness  is  always  in- 
creased by  the  graduation.  By  Theorem  5.22  of  Noble  (1969), 

T -1  T -1  T -1 

(I  + K HK)  »I-K(H  +KK)  K. 

Tlie  last  expression  is  of  the  form  (5.6)  and  suggests  that  the  use  of  an  MWA  with  the 

natural  extension  might  t>c  regarded  as  a generalized  Whittaker  smoothing  process  if 

-1  T -1 
D - (H  + KK  ) 
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Solving  for  H gives 


-1  T -1 

H = (D  - KK  ) . (11.1) 

We  are  led  to  inquire,  therefore,  under  what  conditions  an  MWA  is  such  that  the  right 
member  of  (11.1)  is  positive  definite.  Clearly  H is  positive  definite  if  and  only  if 
the  Toeplitz  matrix 

-1  -1  T 

H ■=  D - KK  (11.2) 

is  positive  definite. 

Schoenberg  (1946,  p.  53)  remarks  that  it  is  desirable  for  an  efficient  smoothing  for- 
mula, one  that  achieves  adequate  smoothness  without  producing  unnecessarily  large  departures 
from  the  observed  values,  to  have  its  characteristic  function  satisfy  the  stronger  con- 
dition 

0 1 ♦(t)  1 1 ' 

This  remark  seems  to  have  been  little  noted  in  the  years  since  its  publication.  We  shall 
call  an  MWA  a smoothing  formula  in  the  strict  sense  if  its  characteristic  function  satisfies 
the  condition 

0 < i|>(t)  <1  (0  < t < 2ii)  , (11.3) 

and  we  shall  show  that  (11.2)  is  positive  definite  for  all  N if  and  only  if  (11.3)  holds. 

-1  T 

Theorem  11,1.  Let  (10.8)  hold.  Then  Q = D - KK  is  positive  definite  for  all 
N if  and  only  if  the  MWA  is  a smoothing  formula  in  the  strict  sense. 

Proof.  Let  (11.3)  hold,  let  v be  an  arbitrary  nonzero  real  N-vector,  and  consider 
the  Kayleigh  quotient, 

r “ = (c|b’’  v|^  - Ik"^  v1^)/1v|^  (11.4) 

V V 

where  B is  given  by  (10.12).  I«t  V(t)  be  defined  by  (10.16),  Then,  by  Parseval's 
formula. 


by  (10.10).  Itoreover,  again  by  Parseval's  formula, 


Ik""  vl' 


= ~ ! (4sin^  k)®  lv(t)  1^  dt  . 
0 


It. 


(11.6) 


It  was  shown  in  the  proof  of  Lemma  10.2  that  q(e  ) is  positive,  and  therefore 
h(e^^)  = [q(e^^)]  ^ is  positive,  for  0 < t < 27r  , By  means  of  (11.5),  (11.6),  and 


(10.7),  (10.4)  gives 


2ii 


= ^ / [h(e^^)  - (4sin^  |v(t)|^  dt 


2v 


= i / h(e^^)  *(t)  |v(t)  1^  dt  > 0 . 


2n 

u 

since  any  zeros  of  ^(t)  constitute  a set  of  measure  zero.  Since  r is  positive  for 
arbitrary  nonzero  v,  Q is  positive  definite. 

To  prove  the  converse  we  shall  supix)se  that  (11.3)  does  not  hold  and  show  that  r is 
negative  for  some  N and  some  v . Because  of  tlie  hypothesis  that  (10.8)  holds,  4'(t)  is 
le.ss  than  1 . We  supix>so,  therefore,  that  iji(t)  < 0 for  some  t - t in  (0,  27r)  . Let 
V be  given  by  (10.20).  By  an  argument  similar  to  that  used  in  the  proof  of  Lemma  10.2, 
it  is  easily  shown  that  the  series  (6.2)  for  h(z)  is  absolutely  convergent  for  z - 1 . 
Thus,  for  any  small  positive  quantity  c,  there  exists  a positive  integer  M such  that 

i i>^ji  . 

j=M+l  ^ 

Thus,  for  N > 2M,  the  jth  comijonent  of  Qv , for  j = M + 1,  M + 2, 

multiplied  by  a quantity  less  than 

V / it,  . ,.-21  . E 

h (e  ) + e - (4Bin  J i l 


By  (10.7). 


ui  i 2 1 .s 

h (e  ) - (4Ein  — t ) 


h(o^^)  ♦(!)  . 


iT. 


It  was  shown  in  the  proof  of  Lemma  10.2  that  (10.8)  implies  h(e  ) > 0 

negative.  Choose  t sufficiently  small  so  that  (11.7)  is  negative. 

* 

As  in  the  proof  of  Theorem  10.1,  we  find  that  v v ^ N,  while 


. , N - M,  is  V 

(11.7) 

(11.8) 

Thus,  (11.8)  is 
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(11.9) 


V Qv  £ (N  - 2M)  Ih(e  ) ((i(t)  + £ ] + C , 

where  C denotes  the  contribution  of  the  first  M and  the  last  M rows.  As  in  the  proof 
of  Theorem  10.1,  it  follows  from  symmetry  that  C is  real.  Let 

s-j  2s 


n = I |h.  - (-1)®'^  (^/.)|  , 


,2s. 


where  ( ^ ) is  understood  to  vanish  for  negative  j or  j > 2s  . Then 

C 1 |cl  1 2Mn  , 

since  all  components  of  v have  absolute  value  unity.  Now  take 

it. 


. 2M[h(e  )♦(!)  + £ - n] 

N > —•  ; “ 

h(e*^)  ((.(T)  + e 


(11.10) 


Note  that  iDOth  numerator  and  denominator  of  the  right  member  of  (11.10)  are  negative. 

From  (11.10)  we  obtain 

(N  - 2M)[h(e^’')  t(T)  + e 1 < -2Mn  , 

* 

and  (11.9)  then  gives  v Qv  < 0,  so  that  Q is  not  positive  definite.  This  completes 
the  proof  of  t)ie  theorem. 

It  is  easy  to  construct  an  MWA  that  is  a smoothing  formula  in  the  strict  sense.  A 
trivial  example  is.the  formula 

“x  = ^ <-yx-2  ^^x-l  ^ ^^^x  ^ '•yx+l  - yxt2>  • 

However,  none  of  the  weighted  averages  in  general  use  fall  in  this  class.  In  particular, 
using  the  properties  of  Jacobi  polynomials,  1 have  shown  elsewhere  (Greville  1966)  that  tlie 
characteristic  functions  of  all  minimum-K^  averages  assume  negative  values  in  (0,  27t)  . 
Tlius  no  such  formula  is  a smoothing  formula  in  the  strict  sense. 

There  is,  however,  one  family  of  moving  averages,  mentioned  in  the  literature  but  not 
in  general  use,  that  arc  smoothing  formulas  in  the  strict  sense.  Elsewhere  (Greville  1966) 
I have  considered  the  limiting  case  of  the  minimum-R^  formulas  as  s tends  to  infinity. 

In  finite-difference  form,  the  minimum-R^  MWA  of  2m  + 1 terms,  exact  for  the  degree 
2s  - 1,  is 
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which  is  nonnegative,  with  a single  zero  of  order  m-s  + l at  t = Ti. 

By  a tour  de  force  it  is  possible  to  show  that,  for  an  MWA  that  is  not  a smoothing 
formula  in  the  strict  sense,  but  whose  characteristic  function  satisfies  (10.8),  the  natural 
extension  docs  nevertheless  "minimize  something."  For  the  given  MWA,  let  -p  denote  the 
minimum  value  of  ♦(!),  and  let  y be  chosen  so  that  0 < y ^ (1  + p)  ^ . Then 
1 " Y(1  + p)  0 . Let  a modified  MWA,  u^  be  obtained  by  taking 

“x  “ (11.11) 


6 - y"^ 

Is  positive  definite.  Thus,  the  total  smoothing  operation  including  the  "tails,"  based  on 
an  MMX  that  is  a smoothing  formula,  but  not  in  the  strict  sense,  does  in  fact  minimize  the 
Using  statistical  terminology,  this  expression  may  therefore  be  re- 
•*  • "loss  function,"  but  in  that  context  is  difficult  to  interpret  and  justify  in 
practical  terms. 
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NUMERICAL  CALCULATION  OF  THE  SOLUTION 
OF  THE  VISCOELASTIC  DEFORMATION  OF  AN  INFINITE 
FLOATING  ICE  PLATE  UNDER  A CIRCULAR  LOAD 


Shunsuke  Takagi 

U.S.  Army  Cold  Regions  Research  and  Engineering  Laboratory 
Hanover,  New  Hampshire  03755 


The  unresolved  problem  submitted  to  the  21st  (1975)  Conference  of 
Army  Mathematicians  (Ref.  1)  was  completely  answered  with  the  aid  of 
Professor  Ben  Noble,  Director,  Mathematics  Research  Center,  University 
of  Wisconsin.  The  theoretical  insight  gained  in  the  above  study 
enabled  me  to  devise  a simple  numerical  integral  method  as  shown  below 


PRINCIPLE  OF  OUR  NUMERICAL  INTEGRATION 


Stated  in  general  terms,  our  problem  is  to  numerically  integrate 
the  integral  containing  a product  of  Bessel  functions, 

I = ♦(8)Ji(6A)Jo(0R)d6,  ( 

where  A and  R are  positive  numbers;  $(6)  is  finite  in  the  range  of 
integration,  and  asymptotically 


4>(6)  ' as 


-n 


(? 


in  which  a is  constant.  The  value  of  n in  our  formulas  was  ^4.  We 
shall  change  (1)  to  a finite  integral.  Let 


[Abs  I]” 


r 

-'N 


|4>(6)Ji(bA)Jo(6R)  |dB. 


Choosing  N so  large  that  4>(6),  Ji(bA),  and  Jo(6R)  may  be  replaced  with 
respective  asymptotic  expressions,  the  upper  bound  of  the  right-hand 
side  can  be  estimated: 


[Abs  I]" 


|«(6)Ji(BA)Jo(BR)|dB 

•'N 


Choose  N to  make 


n"  .^_L_L. 

/U  nc  ’ 

Then  we  can  convert  I to  a finite  integral, 

T'-.  ♦(B)Ji(BA)Jo(BR)dB.  (3) 

We  shall  call  [Abs  I]“  the  absolute  remainder. 

This  method  worked  well,  because  n^4.  If  n is  close  to  zero, 
this  method  does  not  work.  A method  for  the  general  case  is  developed 
in  Appendix  B. 


FIELD  MEASUREMENT 

Frankenstein  [2]  measured  the  deflection  of  a lake  ice.  He 
placed  a tank  on  a frozen  lake,  filled  it  with  water,  and  measured  the 
deflection  with  the  measuring  rods  at  the  distances  as  shown  in  Fig.  1. 
One  of  the  results  of  his  measurement  is  shown  in  Fig.  2. 


Fig.  1.  Field  Measurement 


THEORETICAL  CURVE 

We  assumed  the  lake  ice  to  be  a linear  viscoelastic  material  of 
the  Maxwell -Voight  type  four-element  model,  as  shown  in  Fig.  3.  This 
model  is  known  to  satisfactorily  i^epresent  the  creep  of  ice  [Jellinek 
and  Brill  3]. 


s 


2 


I 


Time  (min) 


Fig.  2.  A result  of  Frankenstein's  measurement  (Ref.  2,  Test  8). 


Leed  (l») 


First  we  solved  the  case  of  step  loading,  i.e.,  under  the  assumption 
that  the  circular  load  P of  radius  a was  applied  at  time  t = 0 and  kept 
constant  for  t>0.  The  process  of  solution  is  shown  in  Appendix  A. 

For  our  curve-fitting  we  need  only  the  deflection  solution, 

Jo  i ,^RF^  ' 


0)  - 


irApi. 


(4) 


in  which  the  functions  of  the  integral  variable  6 are 
DESC  = (t3**+1+t)^  - 4t(3**+E) 


“‘1 
“2  ) 


t3_+ 1+ T+ vBESC 

2(7^) 


The  radius  a of  the  loading  circle  and  the  radial  coordinate  r are 
nondi mens ionali zed  to  A and  R, 

R = 

^ z ’ 

by  use  of  the  characteristic  length 

Eoh 


12p(l-v) 
where  Eq  is  defined  by 

1=1  +i 

Eo  El  E2- 

and  h,  p,  and  v are  the  thickness,  density  and  the  Poisson  ratio, 
respectively,  of  the  ice  plate.  Time  t is  nondimensionalized  to  T 


T = 


Eot 

ni 


Nondimensional  material  constants 

niE  Eo 

T = and  E = — 

naEo  El 

are  used. 
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We  list  in  the  following  the  asymptotic  expansions  of  the  non-Bessel 
factors,  *(6),  contained  in  the  integrands  of  the  deflection  solution  (4), 
and  the  stress  solutions  (A. 37)  and  (A. 38)  in  Appendix  A: 


_4 


02  'V'  ^ 

r(l+6"**) 

e-“iT 

^ '1-TB 

e-“2T 

T-Oj 

_*♦ 

-x.  B -6 

vIlESC 

(ii) 


t-02  8 

-(I-E)b'  . 

v^ESC 


The  actual  loading  was  the  ramp/steady  loading,  as  shown  below: 


Fig.  4.  Definition  of  the  ramp/steady  loading. 


To  calculate  the  deflection  under  the  ramp/steady  loading,  define 
the  influence  function  oao(T)  by  letting  P=1  in  (4), 


u)o(T)  = 

irAp*’ 


4 4 

foy  B (T-aa)  j B (x-ai)  _ j 

[1  + e e ] Ji 

Jo  /DESC’ 


(BA)Jo(BR)dB. 


(7) 


Then,  deflection  a)(T)  for  0£T<Jo  is  given  by 

«>(T)  = r o)o(T-x)PdX 
; 0 

and  for  To<T  by 

('^0 

(o(T)  = a)o(T-X)PdX  , 

Jo 


(8) 


where 


Po 


Substituting  (6)  into  (7)  and  integrating  with  regard  to  x, 
we  get  the  deflection  u)(T)  for  0<T£To: 


j(T)  = — V(Ui-U2+U3), 

irAp 


(9) 


where 


Ui  = ^e““l^-l+aiT)Ji(0A)Jo(BR)d6 


l]^l-e-“i^)Ji(BA)Jo(eR)d6 
Jo  vUESU  “1 


Ua  = r ^-i^^l-e-“2'^)Ji(BA)Jo(BR)dB. 
70  “2 

The  absolute  remainders  are  as  follows: 

[Abs  UilJ  < T^  ^-4 

4iTi7sir 
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Substituting  (6)  into  (8)  and  integrating  with  regard  to  X, 
we  get  the  deflection  u)(T)  for  TqiT: 


w(T)  = — ^-z^Ii+Ia+Ia).  (10) 

itAp^ 


where 

° 0i(eA)j.(6R)da 


l-e"“i'^)^(e“i^o.i)j^(eA)Jo(6R)de 


'OO 

I3  = Jo  ^-ii3^1.e’“i'^)Ji(3A)Jo(BR)dB. 
y^ESC 


The  absolute  remainders  are  as  follows: 


[Abs  IilJI  < (2Tr/^R)“^N“^ 


[Abs  IzIm  < — ^ N"^ 


[Abs 

^ ZitM  Tin 
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CURVE  FITTING 

The  material  constants  found  by  the  curve  fitting  are  shown  in 
Table  1.  They  vary  with  the  location  of  the  measurement. 


TABLE  1 

Tank 

Rod  1 

Rod  2 

Rod  3 

Distance 

1.8m 

4.9m 

9.8m 

19.6m 

X 

2 

6.5 

20 

5 

E 

0.0005 

0.007 

0.05 

0.1 

Eo(kgV) 

1.766x10® 

9.813x10^ 

6.869x10® 

9.813x10^^ 

n/Eo(sec) 

2.815x10® 

4.896x10® 

1.101x10® 

2.448x10® 

TE(ramp)(m)  4.718x10"^ 
TE(flat)(m)  4.727x10"^ 

5.812x10"® 

2.730x10"® 

2.063x10"® 

2.884x10"^ 

2.750x10"® 

Material  constants  found  by  using  the  time-lapse  curves 
of  Frankenstein's  concentrated  load  test  (Ref.  2,  Test  8) 


To  show  the  significance  of  the  material-constant  variation  with 
the  measurement  locations,  we  chose  the  material  constants  determined 
at  Rod  1 of  Frankenstein's  concentrated-load  time-lapse  curve,  and  com- 
puted the  deflections  at  the  other  measurement  locations.  Fig.  5 shows 
the  comparison  of  the  computed  curves  and  the  measured  data.  The  left 
and  right  columns  show  the  ramp  and  steady  portions  of  the  deflection 
curves,  respectively.  They  are  designated  by  (r)  and  (s)  respectively. 
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Timt  (s) 


Fig.  5.  Comparison  of  the  calculated  curves  and  measured  points  of 
Frankenstein's  concentrated  load  test.  Material  constants 
are  determined  by  use  of  the  measurement  at  Rod  1. 

To  express  the  degree  of  curve  fitting  we  devised  the  trapezoidal 
error  (TE).  In  Fig.  6,  let  a,  A and  b,  B be  two  pairs  of  measured  and 
computed  deflections  at  two  consecutive  times  t,  and  t, , respectively. 

H A 4 


Fig.  6.  Elements  of  TE 
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We  squared  A-a  and  B-b  and,  in  case  of  the  left  figure  where  the  errors 
are  of  the  same  sign,  computed  the  area  of  the  trapezoid  whose  bases  are 
{A-a)2  and  {B-b)^  and  the  height  t2-ti.  In  case  of  the  right  figure  where 
the  errors  change  sign,  we  calculated  the  sum  of  the  areas  of  the  two 
triangles  whose  bases  are  (A-a)^  and  (b-B)2  and  heights  to-ti  and  t2-to 
respectively,  where  to  is  the  abscissa  of  the  intersection.  Let 
S be  the  area  thus  computed.  Then  the  TE  is  defined  by 


where  the  summation  is  over  all  the  intervals  and  T the  sum  of  the 
abscissa  intervals. 

The  TE  indicates  a sort  of  absolute  maximum  error.  Its  unit  is  m. 
If  the  deflections  are  of  ordinary  magnitude,  the  TE  of  order  10'^  and 
10"2  means  a good  and  tolerable  fit,  respectively.  If  the  deflections 
are  very  small,  as  in  the  case  of  Rod  4,  the  smallness  of  the  value  of 
TE  does  not  mean  much.  We  did  not  list  the  computed  values  at  Rod  4 
in  Table  1. 


We  evaluated  the  TE  for  all  the  possible  cases.  They  are  shown 
in  Fig.  7.  The  abscissa  is  the  distance  from  the  center  of  the 
load.  The  measurement  locations  are  noted  on  the  abscissa  axis.  The 
circled  points  are  those  whose  material  constants  are  used  to  compute 
a set  of  TE.  The  sets  of  TE  thus  computed  are  connected  with  solid 
lines  and  labeled  with  the  appelations  of  the  circled  measurement 
locations. 


Dt(tonr.«  Im) 


Fig.  7.  The  TE  of  Frankenstein's  distributed  load  test  No.  8, 


We  carried  out  the  curve  fitting  on  the  assumption  of  the  linear 
viscoelasticity,  i.e.,  that  the  material  constants  are  kept  absolutely 
constant  during  the  increase  of  both  the  stress  and  the  deformation. 
However,  Dr.  Andrew  Assur,  an  ice  mechanics  expert,  CRREL,  notified  me 
that  the  real  material  constants  are  nonlinear,  i.e.,  that  they  change 
with  the  stress  and  the  deformation.  He  told  me  that  the  variations 
shown  in  this  paper  are  reasonable  from  the  nonlinear  viewpoint.  At 
the  writing  of  this  paper  we  could  not  complete  examination  from  the 
nonlinear  viewpoint. 


ASYMPTOTIC  DEFLECTION 

We  shall  show  in  the  following  that  only  one  material  constant  is 
contained  in  the  asymptotic  formulas.  Therefore,  curve  fitting  must  be 
performed  for  sufficiently  small  times. 

Referring  to  the  asymptotic  relationships  in  (5).  one  finds  that, 
when  T is  large,  both  the  step- loading  formulation  (6)  and  the  ramp/ 
steady  loading  formulation  (10)  reduce  to 

_ P K / , , \ 


where 


)Ji(6A)Jo(eR)d6. 


(12) 


It  is  considered  in  this  derivation  that  t>b,  and  that  only  large  values 
of  6 are  effective  in  the  integration.  Letting 

X = bA, 

(12)  becomes 

K = (l-e‘'^AX  ^)j^(x)Jo(Jx)dx,  (13) 

(14) 

(15) 


i 


where 
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Thus  all  the  material  constants  are  lumped  into  the  second  factor  of 
(15).  The  stress  formulas,  although  not  mentioned  here,  can  be  simi- 
larly transformed. 


As  shown  in  the  Appendix  B,  (13)  cannot  be  analytically  integrated. 
It  must  be  numerically  integrated.  To  effect  the  numerical  integration, 
we  have  chosen  the  non-Bessel  factor  in  (13)  to  become  zero  at  x = ®. 

The  absolute  remainder  is  estimated: 


[Abs-.  K]“ 


2ir^ 


Graphs  of  integral  K for  the  values  of  R/A  =0.2  and  2.0  are  shown 
In  Fig.  8.  When  T.  = ®,  the  non-Bessel  factor  becomes  equal  to  one. 

At  this  limit,  therefore,  K = 1 when  R < A,  and  K = 0 when  R > A. 

As  shown  in  the  graphs,  this  limit  is  almost  reached  when  > 1000. 


ExocI 

^,,^-'^sympfotlc 


10"  10 

T. 


Fig.  8.  Graphs  of  asymptotic  integral  K in  (6.4), 


Exact  integral  K was  formulated  for  the  ramp/steady  loading,  and 
evaluated  by  use  of  a set  of  constants:  Tq  = 6x10^  sec,  x = 10, 

E = 1/6,  m/Fo"  6.12x10**  sec  = 17  hrs..  Eg  = 7xl0®kg/m2,  and  v = o.5. 
These  constants  give  ^ - 29.31m  and  T^  = t(2.48  x 10‘^day‘i)  . As 
shown  in  Fig.  8,  the  asymptotoc  integral  K is  very  close  to  the  exact 
integral  in  the  range  T.  > 0.1.  The  above  constants  were  the  values  we 
used  at  the  outset  of  tne  numerical  computation  for  the  rough  estimate. 
We  did  not  use  other  sets  of  constants  to  evaluate  the  integral  K. 


APPENDIX  A 


Analytical  solution  for  the  step  loading 


The  Prohlem 


We  shall  consider  the  viscoelastic  ice  plate  floating  on  water 
extending  horizontally  to  infinity.  (Historical  background  of  the 
analytical  study  isT  mentioned  in  Appendix  C. ) We  shall  use  the 
Maxwell- Voigt  type  four-element  model  (Fig.  3)  to  describe  the  visco- 
elastic deformation  of  ice. 

Using  the  notation  of  Fig.  3*  one  can  show  that  this  model  gives 
the  stress-strain  relationship  which  we  show  in  an  operator  form. 


r JL  + 

Hi  3 

at 


1 


E2  + n^a 

at 


]o 


(A.l) 


where  t is  time.  To  extend  the  one- dimensional  relationship  (A.l)  to 
the  three-dimensional  i-elationshlp , we  asstane,  as  explained  by  FlUgge  [U] 
that  c and  o are  deviatorlc , and  relate  them  by 


2Ge 


where  0.  is  the  rigidity  modulus  relative  to  the  three-dimensional  defor- 
mation. Using  (A.l),  2G  is  given  as  an  operator. 


1 

2G 


El  * nii^  E2  + n2|_ 


(A.2) 


at 


at 


The  differential  equation  describing  the  deflection  w of  an 
elastic  plate  on  water  is 


DV**w  + pw  * q » 


(A.3) 


where  V is  the  blharmonic  operator 

V"* 


a2 

( * 
' ax^ 


a?^ 


p the  density  of  water,  q the  load  per  unit  area,  D the  flexural  rigidity 
defined  by 
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D = 2Gh3/[  12(1  - v)  ] , 


(A.l*) 


in  vhich  b is  the  thickness  of  the  ice  plate,  and  v Poisson's  ratio. 
Substitute  2G  from  (A. 2)  into  (A.U);  substitute  D thus  found  into  (A.3)i 
then' one  finds  the  differential  equation  governing  the  viscoelastic 
deformation  of  a floating  ice  plate.  We  shall  show  this  equation  later 
in  the  nondimensional  form. 


We  assume  the  load  q to  be  step  loading  at  t = 0 distributed  vini- 
formly  over  the  circle  of  radius  a vith  the  center  at  origin.  Let  r 
be  the  radial  distance  from  origin.  Then 


q = q^U(t) 

for  0 = r 

= 0 

for  a < r 

(A.5) 


vhere  U(t)  is  the  step  function.  Our  problem  is  axisymmetric , and  the 
biharmonic  operator  reduces  to 


V-  = ( 


3r^ 


+ ^ )2 
r 3r  ' 


We  shall  nondimen sioneilize  our  differential  equation.  We  define 
the  characterisnic  length  I by 

o'*  = E h3/[l2p(l  - v)l  , 

o 


(A. 6) 


where 


I 


(A. 7) 


We  have  chosen  E , rather  than  E2  or  Ep  , to  define  i because 
E is  related  to  the  secondary  creep  [Nevel  5],  which  is  the  main 
interest  in  ovir  field  observation. 


Let  be  defined  by 


Di  = D/Cpl") 


(A. 8) 


i 
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Use  of  (A.1+)  and  (A. 8)  to 


Di  = 2D/E 

o 

Substituting  G from  (A. 2)  into  (A. 9)  yields 


(A.9) 


D,  = l/{ 


(A. 10) 


Choose  nondimensional  time  T, 


T = E^t/m  , 


and  a parameter  t. 


T = riiS2/(Ti2E^) 


Then  (A. 10)  becomes 


(A. 11) 


(A. 12) 


D,  1 ni/na 

■ = • 3^  * TTT->  • 


(A. 13) 


where 


It  is  noted  that 


Eo/Ei  . 


(A. 14) 


0 = E ^ 1 . 


(A. 15) 


Clearing  the  denominator,  (A. 13)  becomes 


Dl  = |^(|^  + t)/{e|;j^  + (1  + t)^  + t} 


(A. 16) 


where  use  is  made  of  the  relation 


Et  + m/n; 


which  one  can  prove  by  use  of  (A. 12),  (A.l4),  and  (A. 7), 


Define  the  nondimens ional  length  R by 


R = r/Jl  . 

Replace  D in  (A. 3)  with  D in  (A. 8)  and  (A. 3)  becomes 

+ w = q/p  , 


(A. 17) 

(A. 18) 


where 


li-12 

R dR'^ 


With  Di  given  by  (A.16),  (A.i8)  is  the  differential  equation  to  be  solved. 


(A. 19) 


(A. 20) 


(A. 21) 


Applying  these  two  transforms,  (A.l8)  becomes 

+ w = q/p  , 


Applying  the  two  transforms  to  q in  (A.5)>  one  gets 


I 


L 


where 


(l/p)5  = (P/{ttAp)i2))(i/3)Ji(ba)  , 

P = Tra^q 


and 


A = a/i. 


Thus  the  transformed  solution  is  given  by 

^ Tikol^  6(1  + DiB*')  • 

Performing  the  Hanhel  inverse. 


Performing  the  Laplace  inverse. 


To  find  L ^(l/(l  + DiS**)),  compute  the  partial  fraction. 


1 B‘*(t  - az)  1 6'*(t  - ai)  ^ 

s 1 + 5^6“*  s /DESC  3+02  /dESC  s + oj 


where  -oj  and  -02  are  the  roots  of  the  quadratic  equation 
(E  + 6‘*)s2  + (tB**  + 1 + t)s  + t = 0 . 

They  are  given  by 


^ tB**  + 1 + T + /PESO 

02^  2(b‘*  + E) 


(A. 23) 

(A. 24) 

(A. 25) 

(A. 26) 

4 

(A. 27) 

(A. 28)  f 
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(A. 29) 


DESC  = (ra**  + 1 + t)2  - 1*7(3'*  + E)  , 


which  transform  to 


(A. 30) 


= [t(3'*  + 1)  - 1]2  + 1*t(1  - E)  . 


(A. 31) 


From  (a. 31),  it  is  clear  that 


DESC  > 0 . 


(A. 32) 


The  roots  aj  and  a.^  are  therefore  aaways  real.  Moreover,  inspection 
of  (a. 29)  and  (A. 30)  shows  that  both  aj  and  02  are  al”ays  positive. 


Thus  one  finds  that 


L-l( 1 ) = 1 + 

1 + Sis'* 


B'*(t  - 0.2)  -a2T  3'*(t  - aj)  -ajT 


(A. 33) 


Substituting  (A. 33)  into  (A. 27),  solution  w is  found: 


{1  + 


3'*(t  - CX2)  -a2T  3'*(t  - aj)  -aiT 


e }Ji(3A)Jo(3R)d3. 


(A.3i+) 


The  radial  and  hoop  stresses  are  given  by 

6D,32w  V 3w, 

“^r  - - ^ F 


6D  , 1 3w  3 . 

^ 

Changing  D to  Dj  by  use  of  (A. 8)  and  r to  nondimens ional  R by  use  of 
(a. 17),  they  become 


r h^  R 3R^  ’ 


and 


6pi2  ^ ,1  aw 


where  Di 

^ is  the  operator  on  T given  by  (A.16).  The  two-sided  Laplace 
transform  yields 


6p^,32 

h^ 


. V 3 


and 


'0 


6p*.2  1 a 32  — 


where  Djw  is  the  Laplace  transform  of  Djw  . 
Using  (a. 26)  one  gets 

5i 


TfApi-  Jo  ^ g „4 


Jl(6A)Jo{6R)d8  . 


The  Laplace  inverse  of  this  is 


Div  = . L-^(- 


Dl 


1 4 Die** 


■)Jl(6A)Jo(BR)dB 


To  find  L ^(Di/(l  + DiB**))  , compute  the  partial  fraction. 


1 


T - ai 


® 1 + DiB**  /DESC  ^ “1  /DESC  ® “2 

Thus  one  finds 


L-1(_A_)  _ - - “1  .-a,T 


-aaT 


1 + DiB**  AiESC 

Thus  the  inverseof  (A. 35)  is 


/DESC 


6p 


'r  irAh^  |^Jl{BA){Jo(BR)  - ^^^i(BR)}  = = B^dB  . 


(t  - oi  )e  - (t  - a2)e 


-a2T 


(A.35) 


(A.36) 


/desc 


(A. 37) 
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The  inverse  of  (A. 36)  is 


6p 

irAh^ 


O 

Jl(6A){vJo(6R)  + ~-^l(eR)} 


(t  - aj) 


^-ajT 


(t  - a2)e 


-a2T 


6^d6 


/DESC 


(A. 38) 


Tabulation  of  a and  Og  becomes  easier  if  linear  combinations  of  (A. 37) 
and  (A. 38)  that  do  not  contain  v are  computed. 


APPENDIX  B 


ANALYTICAL  BACKGROUND  OF  THE  NUMERICAL  INTEGRATION 


Bl.  The  foilwing  theorem  shows  the  condition  under  which  the  integral  (l) 
becomes  either  discontinuous  or  continuous  at  R = A. 

Theorem  1.  The  integral  (l)  is  discontinuous  or  continuous  at  R = A when 
n in  (2)  is  equal  or  larger  than  zero,  respectively. 

Proof.  We  can  rewrite  (l)  to  a one-parameter  integral 


1(a) 


f (x,a)dx 


(E.l) 


by  letting  x = 3A,  i.e.  a = R/A,  where  f(x,o)  is  continuous  with 
regards  to  x and  a.  The  condition  that  l(a)  is  a continuous  function 
of  a is  that  the  integral  (B.l)  converges  uniformly  with  respect  to 
a (c.f.  Titchmarch  [7]  p.  25)-  The  integral  (B.l)  uniformly  converges 
when  n > 0,  but  does  not  when  n = 0. 


B2.  We  shall  consider  in  the  following  the  integral  (l)  whose  non- 
Bessel  factor  ()i(6)  is  finite  in  the  range  of  integration  but  asymptotically 
becomes  zero  on  a more  general  form  rather  than  in  the  specific  form  (2). 

Let  an  asymptotic  expansion  of  ()i(B)  be 

m 

<t>(B)  - Z <l>  (B)  . (B.2) 

n=0 

Rewrite  (l)  to 

m 

I = Iq  + I K , (B.3) 

n=0  " 
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where 


{4,(6)  - r 41  (6)}J'i(6A)J  (6R)dp 
o n=o  ° 


(B.4) 


Kn  = j ♦^(e)Ji(BA)J^(BR)d3. 


(B.5) 


We  choose  such  an  integer  m that  makes  Iq  fast  convergent.  We  choose 
such  a function  4<r,  (6)  that  makes  (B.5)  analytically  integrable.  The 
following  Theorem  is  useful  for  the  choice  of  4>n  (b). 

Theorem  2.  Let  F(2)  be  an  even  function  of  complex  variable  z = x + iy 
that  becomes  zero  at  z = <«>  and  possesses  only  algebraic  singularities 
(pole  or  branch  points)  on  the  upper  half  plane  but  no  poles  on  the  real 
axis.  Then  the  integral 

I F(x)Ji(ax)J  (bx)dx, 

Jo  ° 

where  a and  b are  positive,  transform  to  the  following  contour  integrals. 


= ^ F(o)  + F(z)HP^{az)J^(Bz)dz, 


when  o<b<a 


= -2-J^  F{z)Ji  (az)H^^^(bz)dz, 

J ~00 


(B.6) 


{B.7) 


when  o<a<b 


/+00 

whereU^  means  the  integral  along  the  contour  in  Fig.  9,  where 
radius  e is  infinitesimal,  and  the  cut  is  along  the  negative  real  axis. 


Proof.  The  proof  in  ref.  [1]  must  be  revised  to  the  following: 
Consider  the  contour  integrals 


I(a>b)  = F(z)Hil^(az)J^(b2)dz 


when  o<b<a,  and 


I(a<b)  = F(z)Ji(az)H^l^(bz)dz 

' -»00 


when  o<a<b.  Use  of  the  asymptotic  formulas  show  that  Hj  ^ (az)  (bz) 
and  Ji (az)H(^^ (bz)  are  zero  on  the  infinitely  large  circle  when 
o<b<a  and  o<a<b,  respectively.  Therefore  we  may  consider  only  the 
contour  along  the  real  axis. 

1 


I(a>b)  = Y I F(z)Hi (az) J^(bz)dz 

I(a<b)  = j I F(z)Ji (az)H^^^ (bz)dz. 

J *00 


Divide  the  real  axis  in  three  parts,  -“'-e.-e-e,  and 
e~“>.  Let  z = -X  in  the  region  -oo--e,  and  z = x in  the  region 
Neglecting  the  infinitesimal  terms,  one  gets 

F(z)  = F(o) 


Hi^\az)  = -2i/(Tiaz) 


H^i''(bz)  = [21/ti]  )log(bz/2). 


Then  (c)  and  (d)  become 


1 ( " 

I(a>b)  = 4f(o)  + F(x)J,(ax)J  (bx)dx 

2 Jo  ^ o 


i 
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and 


I r 

I I(a<b)  = F(x)Ji(ax)J  (bx)dx.  (f) 

i •'o  ° 

I 

Eq  (e)  and  (f)  prove  (B.6)  and  (B.7),  respectively. 


B3.  The  need  of  Theorem  2 appears  frequently  In  the  mathematical  study 
of  the  problems  of  floating  Ice  plate  and  also  the  problems  of  elastic 
plate  on  an  elastic  foundation.  A similar  Integral  Including  only  one 
Bessel  function  was  proved  by  Dougal  as  early  as  In  1903  ([8]  p.  138  and 
147). 

When  t = 0,  our  solutions  of  the  viscoelastic  plate  reduces  to  the 

solution  of  the  elastic  plate.  The  elastic  solution  thus  found  Is  com- 
posed of  the  following  Integrals: 


r 


M 

0 

1 1 J (ax)J  (bx)dx 

-'o  ..  1 

° l+x** 

(■“ 

(B.8) 

Mi  = 

Jl(ax)J  (bx)dx 

0 l+x"*  ° 

(B.9) 

M2  = 

|°°  x^  Ji(ax)J  (bx)dx. 

(B.IO) 

l+x** 


M2  “ bel(b)ker '(a)  + ber (b)kei ^(a)  when  b^a 


= bei '’(a)ker(b)  + ber  ^(a)kei(b) , when  a^b 

M and  M2  are  found  by  directly  applying  the  theorem.  Mj  Is  found  by 
differentiating  M with  regard  to  b.  Wyman  [9]  derived  by  Inte- 
grating a concentrated-load  elastic-plate  solution  over  the  loading 
circle. 

The  continuity  of  M and  M2  at  a • b Is  obvious  on  the  strength  of 
Theorem  1.  We  will  show,  however,  a direct  proof  In  the  following. 

We  shall  prove  that 

ber (x)ker '(x)  - bel(x)kel '(x)  + x 

• ber  ^(x)ker  (x)  - bel '‘(x}kel(x)  (B.ll) 


and 


bel(x)ker '(x)  + ber(x)kel'(x)  « bel'(x)ker(x)  + ber '(x)kel(x) . 

(B.12) 

To  prove  this,  note  that 

Wj (x)  “ ber(x)  + 1 bel(x)  (B.13) 

and 

W2(x)  - ber(x)  + 1 kel(x)  (B.14) 

are  the  solutions  of  the  differential  equation 


d^w  + 1^  ^ - Iw  ■ 0. 

dx^  X dx 

This  can  be  proved  by  decomposing  the  equation 

0 


(B.15) 


(d^  + 1^  w + w 

dx^  X dxy 


of  which  (B.13)  and  (B.14)  are  the  solutions. 


618 


One  can  find  that  the  Wronsklan 


j 

( 

I 


I 

I 

i 


R 


(x)  Wj  (x) 

W'(x)  w'(x) 

1 2 

is  equal  to 

-1 

■ -X  ■. 

Thus  one  has  the  identity 

ber(x)  + 1 bel(x)  ker(x}  -+  i kei(x) 

' ber'Cx)  + 1 bei'Cx)  ker' (x)  + i kel '(x) 


of  which  the  real  part  gives  (B.ll)  and  the  imaginary  part  gives  (B.12). 
Theorem  2 can  be  extended  in  many  ways.  Nevel  [10]  found  that 


r i cir 

J _ F(x)dx  • ^ y F(z)togzd2 


(B 


for  an  odd  function  F(z)  that  does  not  have  any  pole  on  the  real  axis 
and  vanishes  at  z • •>>. 


BA . It  is  impossible  to  apply  Theorem  2 to  the  integrals  of  w at  (4), 
Oy  at  (A.37),  and o g at  (A. 38}  by  the  following  reason. 

The  function  exp  (-02!)  has  essential  singularities  at  the  roots  of 
B**  + E « 0, 

because 

tiffl  02  “ 

B**-»  -E 

The  function  exp  (-OjT)  does  not  possess  any  essential  singularities 
because  the  limit  of 


oj  - 2t/{tB'*  + 1 + t + /(tB**  + 1 + t)2  - 4t(B‘*+  E)) 
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.16) 


! 


Theorem  2 does  not  apply  to  Integral  K at  (13)  because  the  point 
X « 0 Is  an  essential  singularity. 


Only  the  other  alternative  for  the  Integration  Is  the  use  of  Barnes' 
Integral  method.  It  consists  In  substituting  the  Integrals 


J (x) 

V 


ri 

^ ..00^ 


r(v+s+i) 


(B.17) 


and 

%(\h-i)tt1  (1)  ^ _±_  r(-v-s)r(-s)(4z)'^^®ds  (b.18) 

V 2v± 

> -c-t»l 

for  Jy(x)  and  H where  c Is  a real  number  satisfying  c>R(v),  z Is 

complex,  and  x Is  real.  One  can  usually  exchange  the  order  of  Integration 

to  carry  out  the  Integration  with  regard  to  x or  z.  Then,  one  can  carry 

out  the  rest  of  the  Integration  In  most  cases  by  use  of  the  theorem  of 
residue.  Only  the  forms  (B.17)  and  (B.18)  serve  this  purpose.  The  other 
Barnes'  representations  of  Jy(x)  and  Hy'^^(x)  do  not  enable  one  to  carry 
out  the  above  two  procedures. 

However,  as  mentioned  by  Watson  ([11]  p.  192),  (B.17)  does  not  hold 

true  for  v » 0,  and  (B.18)  does  not  hold  true  when  v = 0 and  z Is  real. 

In  these  two  cases,  the  Integrands  of  (B.17)  and  (B.18)  become  proportional 
to  s“^  as  s approaches  1“  as  the  llmlte  of  the  Imaginary  axis.  Therefore 
we  cannot  use  Barnes'  Integral  method  to  carry  out  our  Integrals. 
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APPENDIX  C 


Historical  Background  of  the  Analytical  Study 


Since  ancient  times  floating  ice  plates  have  "been  used  to  cross 
rivers  and  lakes.  Recently  traffic  load  has  increased.  Vehicles  have 
become  heavier;  aircraft  landing  and  parking  also  add  to  weights.  In 
these  several  years  oil  companies  started  to  use  ice  plates  as  drilling 
platforms . 

Formulation  of  the  creep  of  a floating  ice  plate  began  after  World 
War  II  with  the  intense  development  of  the  linear  viscoelasticity  theory. 
In  19^7  Golushkevich  (referred  to  by  Kheysin  [l]  ) presented  an  analysis 
assuming  that  ice  behaves  elastically  for  volumetric  deformations  and 
viscoelastically  for  deviatoric  deformations.  Kheysin  [l]  used  a general 
viscoelastic  thin-plate  theory  to  analyze  the  infinitely-wide  floating 
ice  plate.  He  used  the  Maxwell  model  (Fig.  l),  and  considered  only  a con- 
centrated load.  Nevel  [12]  also  used  the  Maxwell  model,  but  considered 
a distributed  load.  He  limited  his  numerical  computation  only  to  the 
center  of  the  load. 

William  L.  Ko,  as  reported  by  Garbaccio  [li»,15],  used  the  Maxwell- 
Voigt  type  four-element  model  (Fig.  l),  which  is  known  to  represent  the 
creep  of  ice  satisfactorily  well  [Jellinek  and  Brill  3].  In  addition 
to  thin-plate  theory,  Ko  used  Reissner's  plate  theory,  which  includes 
the  deformation  due  to  vertical  shear  forces.  Garbaccio  [15] 
numerically  evaluated  Ko's  solution  for  specific  values  of  material 
constants  rather  than  for  nondimensional  parameters.  Garbaccio 's 
numerical  answers  show  a strong  effect  of  the  discontinuity  of  the  load 
distribution  on  the  values  of  deflection.  It  is  reasonable  to  suspect 
that  his  numerical  evaluation  may  contain  some  errors. 

Yakunin  [l6,  17]  has  solved  the  same  problem  as  Ko,  but  Yakunin  used 
only  thin-plate  theory.  Unfortunately,  only  an  abstract  of  Yakiinin's 
work  is  available  to  the  western  researchers. 

Katona  [l8]  and  Vaudrey  and  Katona  [19]  solved  the  same  problem 
with  a finite-element  viscoelastic  computer  program. 

We  solved  this  problem  analytically,  and  also  developed  an  effective 
method  of  numerical  integration.  However,  the  theoretical  curves  did 
not  satisfactorily  fit  the  field-test  curves.  It  is  now  evident  that  a 
large  scale  laboratory  test  eliminating  the  variation  due  to  natural 
conditions  must  be  carried  out  and  the  applicability  of  the  theoretical 
assumptions  must  be  tested. 
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ABSTRACT.  Many  systems  contain  time  delays  that  have  a significant 
effect  on  the  overall  operation  of  the  system.  For  example,  time  delays 
can  result  from  telemetry  between  ground-based  guidance  systems  and 
drones,  and  from  reaction  or  decision  times  of  human  operators  In  radar- 
tracking systems.  Myqulst-type  results  and  functional  analytical  tech- 
niques are  available  for  designing  controllers  for  such  systems,  but  it 
Is  difficult  to  Implement  these  methods  due  to  the  infinite  dimension- 
ality of  the  underlying  vector  spaces.  In  contrast  to  these  approaches, 
the  presentation  given  here  deals  with  an  approach  based  on  the  algebra- 
ic structure  of  the  system  model.  In  this  algebraic  setting,  designs 
can  be  constructed  using  computations  based  on  matrices  and  vectors  de- 
fined over  rings  of  operators. 

1.  INTRODUCTKM.  In  the  last  decade  there  has  been  a good  deal  of 
effort  (see  the  survey  [1])  devoted  to  the  study  of  dynamical  systems 
described  by  functional  differential  equations  In  n-dlmenslonal  space. 
Much  of  this  mathematical  theory  centers  on  the  class  of  linear  systems 
with  time  delays  given  by  a set  of  delay  differential  equations.  Sys- 
tems with  time  delays  have  appeared  In  many  engineering  problems  for  the 
last  several  decades  [2,3].  In  many  of  these  applications  the  delays 
resulted  from  the  flow  of  fluids  or  gases  In  various  types  of  Industrial 
processes  (e.g.  sec  [4]).  In  many  new  applications,  time  delays  result 
from  telemetry  or  comnunlcatlon  links  between  components  of  a system 
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located  large  dlstancea  from  each  other.  For  Instance,  sizable  time  de- 
lays occur  In  the  ground-based  guidance  of  drones  at  White  Sands  Missile 
Range  [53.  Time  delays  can  also  result  from  data  processing  In  the  exe- 
cution of  on-line  control  and  estimation  algorithms. 

Systems  with  time  delays  arise  In  many  military  applications.  For 
example.  In  addition  to  the  drone  control  problem  at  White  Sands,  delays 
can  result  from  reaction  or  decision  times  of  hiuoan  operators  In  radar- 
tracking systems  and  manned-vehicle  systems  [63.  In  ground  warfare,  de- 
lays can  result  from  the  relocation  of  men  or  weapons. 

Despite  recent  efforts  to  develop  an  extensive  mathematical  theory 
for  time-delay  systems.  It  appears  that  few  viable  new  techniques  have 
been  made  available  for  the  applications  mentioned  above.  This  Is  due 
In  part  to  the  "gap"  between  new  mathematical  theories  and  current  engi- 
neering practices.  However,  the  primary  difficulty  Is  due  to  the  fact 
that  time-delay  systems  are  Inflnlte-dlmenslonal  systems.  Thus  the  Im- 
plementation of  mathematical  results  In  general  requires  the  use  of  fl- 
nlte-dlmenslonal  approximations.  Further,  If  equations  are  to  be  Im- 
plemented on  a digital  computer.  It  Is  necessary  to  consider  "discrete- 
time" approximations  with  quantized  magnitudes. 

Although  It  Is  clear  that  approximations  must  be  used  In  the  study 
of  time-delay  systems.  It  Is  not  at  all  clear  as  to  when  approximations 
should  be  brought  Into  the  theory.  For  example,  given  a system  model 
with  time  delays,  one  could  Immediately  approximate  all  delays  by  flnlte- 
dlmenslonal  elements,  and  then  proceed  by  applying  the  theory  of  flnlte- 
dlmenslonal  systems.  On  the  other  hand,  one  could  attempt  to  work  with 
the  time-delay  model  as  much  as  possible,  say  by  using  symbolic  computa- 
tions, and  then  Implement  the  resulting  equations  on  a digital  computer. 
In  general,  we  would  expect  the  latter  procedure  to  yield  a higher  degree 
of  accuracy  since  the  time  delays  are  not  approximated  until  the  last 
step.  In  fact,  as  Illustrated  In  the  next  section,  the  first  procedure 
can  result  In  unstable  designs. 

The  main  purpose  of  this  paper  is  to  show  that  various  problems  In- 
volving time-delay  systems  can  be  studied  via  the  second  procedure  men- 
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tloned  above.  The  theory  Is  based  on  a module  framework  for  linear  func- 
tional differential  equations  [7,8,9D.  Although  this  has  not  yet  been 
attempted.  It  should  be  possible  to  apply  the  results  of  the  module  ap- 
proach to  the  engineering  applications  mentioned  above. 

2.  TRANSFER-FUNCTION  TECHNIQUES.  Classical  techniques  for  the 
study  of  time-delay  systems  are  based  for  the  most  part  on  transfer- 
function  representations.  Although  few  general  results  are  available. 

In  some  cases  It  Is  possible  to  apply  standard  methods  such  as  the  Nyqulst 
stability  test.  To  Illustrate  this,  consider  the  following  control  sys- 
tem: 


In  this  example,  l/(s+l)  Is  the  transfer  function  of  the  open-loop  sys- 
tem (a  low-pass  filter  In  this  case).  The  Input  u(t)  of  the  open- loop 
system  is  given  by 

u(t)  - -Ky(t-a)  + r(t) 

where  R Is  the  gain  of  the  amplifier  and  r(t)  Is  an  external  signal.  The 
a-second  time  delay  In  the  feedback  loop  could  be  a result  of  the  control- 
ler (In  this  case  the  amplifier)  being  located  a large  distance  from  the 
open-loop  system.  The  transfer  function  T(s)  of  the  closed-loop  system 
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Is  given  by 


t 

I 


(1) 


T(8) 


1 

8 + 1 + Ke"®® 


~S8 

where  e Is  the  transfer  function  of  the  time  delay. 

It  is  well  known  that  the  closed-loop  system  Is  exponentially  stable 
If  and  only  If  the  zeros  of  the  characteristic  function  s -f  1 + Ke  are 
In  the  left-half  of  the  complex  plane.  Unfortunately,  In  general  It  Is 
difficult  to  compute  zeros  of  characteristic  functions  containing  exponen- 
tlals  e . However,  In  some  cases.  Including  the  above  exaiiq)le,  stability 
can  be  determined  by  using  the  Nyqulst  encirclement  criterion:  Given  a 
fixed  value  of  a,  the  range  of  positive  values  of  K for  which  the  closed- 
loop  system  Is  stable  can  be  computed  by  plotting  e~®®/(s  + 1)  for  s » J«, 

- * < U)  < “>  . 

As  an  example,  let's  set  a = .1.  Since  In  this  case  the  time  delay  Is 
1/10  th  of  the  time  constant  of  the  open-loop  system,  it  may  appear  that  the 
delay  can  be  neglected.  If  we  neglect  the  .1  second  delay,  we  find  that  the 
closed-loop  system  Is  stable  for  any  positive  value  of  K.  If  we  consider 
the  delay  and  apply  the  Nyqulst  test,  we  find  that  the  closed-loop  system 
is  stable  when  0 < K < 16.3  and  unstable  when  K > 16.3.  Hence  the  delay  has 
a very  significant  effect  on  stability.  This  phenomenon  Is  well  known  In 
engineering,  for  example  see  [10,  pages  346-350]. 

Now  let's  consider  a rational  approximation  to  the  delay  so  that  the 
above  system  can  be  studied  using  results  for  rational  transfer  functions. 
Writing 


(2) 


-as 

e 


1 

1 + as  + (as)^/2  + (as)^/6  + ... 


we  get  a rational  approximation  by  truncating  the  series  In  (2) . Again  let 
a > .1,  and  consider  the  following  first-order  approximation 


(3) 


-.Is  ^ 1 _ 10 

® “l+.ls“8  + 10‘ 
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The  approximation  (3)  may  seem  reasonable  In  this  case  since  the  band- 
width of  10/ (s  -t-  10)  Is  ten  times  the  bandwidth  of  the  open- loop  system 
(given  by  l/(s  + 1)).  Using  (3),  we  have  that  the  transfer  function 

A 

TCs)  of  the  resulting  system  approximation  Is  given  by 

(4)  T(8)  - . 

s + 11s  + 10(K+1) 

From  (A),  we  see  that  the  system  approximation  Is  stable  for  any  R > 0. 
Thus  with  the  first-order  approximation  of  the  delay,  we  completely  miss 
the  fact  that  the  given  system  Is  unstable  for  K > 16.3. 

Now  consider  the  second-order  approximation 

-.Is  1 

® ~ 2 ’ 

1 + .Is  + .005s 

The  transfer  function  of  the  resulting  system  approximation  Is  given  by 

X s^  + 208  + 200 

"(a)  , 2 • 

8"^  + 21s  + 220s  + 200  (K+1) 

Using  the  Routh-Hurwltz  test,  we  find  that  the  system  approximation  Is 
stable  when  0 < K < 22.1  and  unstable  when  K > 22.1.  Thus  with  this 
approximation,  we  do  see  that  the  system  Is  unstable  for  large  values 
of  R,  although  the  critical  value  of  R (22.1)  Is  off  a good  deal  from 
the  actual  critical  value  (16.3). 

These  results  show  that  neglecting  or  approximating  time  delays 
can  lead  to  serious  problems  such  as  Instability.  Moreover,  In  many 
cases  the  order  of  the  rational  approximations  must  be  rather  large,  re- 
sulting In  system  models  that  are  difficult  to  work  with  because  of  the 
high  dimensionality.  Given  these  problems,  there  Is  much  Interest  In 
developing  techniques  (e.  g.  design  procedures)  that  do  not  require  the 
approximation  of  time  delays.  As  will  be  discussed  later,  such  tech- 
niques are  already  available.  They  are  based  primarily  on  state-space 
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models  which  sre  defined  In  the  next  section. 

3.  STATE- SPACE  MODELS.  Again  consider  the  time-delay  system  with 
transfer  function  given  by  (1) . It  follows  from  (1)  that  the  system  has 
an  Internal  (or  state)  representation  given  by  the  following  delay  dif- 
ferential equation 

(5)  “ -y(t)  - Ky(t-a)  + r(t)  . 

As  a generalization  of  (5) , we  can  consider  the  class  of  m- Input 
p-output  time-delay  systems  given  by  the  following  state  model 

(6a)  “ AQx(t)  + Aj^x(t-a)  + Bu(t) 

(6b)  y(t)  “ Cx(t). 

In  (6a, b),  AQ,A^(resp.  B,C)  are  n x n (resp.  n x m,  p x n)  matrices  over 
the  field  R of  real  numbers,  u(t)  c R Is  the  Input  or  control  function, 
y(t)  c R^  Is  the  output  function,  and  x(t)  c R^  Is  the  state. 

As  Is  well  known,  to  solve  (6a)  for  t > 0 we  need  Initial  data  con- 
sisting of  the  values  of  x(t)  for  -a  ^ t ^ 0.  Although  x(t)  Is  usually 
referred  to  as  the  state  at  time  t,  as  a result  of  the  delay  term  In  (6a) 
the  actual  state  at  time  t Is  the  function  segment  x(o),  t - a ^ o ^ t. 
Thus  the  space  of  (actual)  states  for  the  system  (6a, b)  is  some  Inflnlte- 
dlmenslonal  vector  space  of  function  segments.  There  are  several  candi- 
dates for  this  function  space.  Examples  Include  the  space  C([-a,0];R°) 
of  R^’^-valued  continuous  functions  defined  on  the  Interval  [-a,0)  and  the 
space  R*'  x L^([-a,0D;R*^)  where  L*’([-a,0];R^)  is  the  space  of  R°-valued 
p-lntegrable  functions  defined  a.e.  on  [-a,0]. 

Much  of  the  mathematical  theory  of  (6a) , or  generalizations  of  (6a) , 
Is  based  on  a characterization  of  (6a)  In  terms  of  an  ordinary  differen- 
tial equation  In  a Banach  or  Hilbert  space  such  as  C([-e,0] ;R°) . For  de- 
tails, see  the  book  [11]  by  Hale  and  the  papers  (e.g.  [12])  of  Delfour 
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and  Mltter.  However,  this  approach  does  not  fully  exploit  the  flnlteness 
of  (6a)  resulting  from  the  assumption  here  that  x(t)  belongs  to  n-dlmen- 
slonal  space.  This  flnlteness  can  be  retained  In  an  operator  setting  by 
expressing  (6a)  as  a vector  differential  equation  with  operator  coeffi- 
cients. The  constructions  are  as  follows, 
loc 

Let  denote  the  space  of  real-valued  Lebesgue-measurable  locally- 

Integrable  functions  defined  a.e.  on  R with  supports  bounded  on  the  left. 
Let  d denote  the  delay  operator  given  by 

drL^®*^  -V  L^°‘^:f(t)  f(t-a) 
where  a Is  some  fixed  positive  number. 

With  (L^°*^)°  * space  of  n-element  column  vectors  over  L^°^,  we  can  extend 
the  delay  operator  d to  (L^®*^)*'  by  defining 

d:  (f,  (t) f (t))^  H-  (f  (t-a) f (t-a))^ 

X n 1 n 

where  T denotes  the  transpose  operation. 

Then  viewing  x(t)  as  an  element  of  (L^°*^)°,  we  can  write  (6a)  In  the 

form 

where  + A^^d  Is  a n x n matrix  operator  acting  on  elements  In  (1^°*^)°. 

As  a generalization  of  (7) , we  can  consider  vector  differential 
equations  with  coefficient  matrices  defined  over  a ring  of  delay  operators: 
Let  R[d]  denote  the  set  of  all  finite  sums  of  the  form  ^a^d^  where  a^cR 
and  d^:f(t)  **■  f(t-la).  With  the  usual  addition  and  mulllpllcatlon  opera- 
tions, R[d]  Is  a ring  of  delay  operators  and  with  the  scalar  multiplication 

RCd]x(L^°‘^)°  -*•  (L^°‘^)'^:(^a  d^,x(t))  h-  ^a  x(t-la)  , 

1 ^ 1 1 

(L^°^)°  la  a module  over  the  ring  R[d]. 
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Now  given  matrices  F(d),  G(d),  and  H(d)  defined  over  R[d],  consider 
the  following  system  equations 

(8a)  - F(d)x(t)  + G(d)u(t) 

(8b)  y(t)  -H(d)x(t). 

By  definition  of  F(d),  G(d),  H(d),  we  can  write 

F(d)  - jF.d^,  G(d)  - jG.d^,  H(d)  - d^ 

1 1 1 

where  the  matrices  over  R.  Thus  (8a, b)  can  be  written  In  the 

form 

(9a)  “ jF^x(t-la)  + [Gj^u(t-la) 

Ob)  y(t)  - Jh  x(t-la). 

1 

Therefore,  the  class  of  delay  differential  equations  given  by  (9a, b)  can  be 
studied  In  terms  of  vector  differential  equations  (8a, b)  defined  over  a 
ring  of  operators.  This  observation,  along  with  results  based  on  the  oper- 
ator ring  structure,  was  first  made  In  [7].  It  Is  now  known  [8,9]  that  the 
operator  ring  framework  applies  to  a very  large  class  of  time-delay  systems. 
Including  systems  with  noncommensurate  delays  and  distributed  delays. 

U.  APPLICATIONS  OF  THE  OPERATOR  STRUCTURE.  In  this  section  we  briefly 
consider  several  topics  that  can  be  studied  via  the  operator  ring  framework. 
We  restrict  our  attention  to  systems  over  RCd]  given  by  (8a, b),  although 
many  of  the  results  discussed  below  apply  to  more  general  rings  of  operators. 

a.  Computation  of  Solutions.  As  shown  In  [8],  Initial  data  for  solving 
(8a)  can  be  Incorporated  Into  the  operator  structure,  so  that  complete  solu- 


632 


and  the  coefficient  matrices  F(d),  G(d)  of  the  system  equation  (8a).  In 
other  words,  solutions  can  be  computed  by  employing  matrix  operations  de- 
fined over  rings  of  operators.  It  should  be  possible  to  Implement  por- 
tions of  this  operational  calculus  by  using  symbolic  computations. 

b.  Realization.  Given  a system  specified  by  Its  transfer  function 
matrix  T(s),  the  problem  of  realization  Is  concerned  with  the  construction 
of  a state  model  from  TCs).  For  systems  with  time  delays,  T(s)  Is  often 

a matrix  of  rational  functions  In  s and  e , In  which  case  we  want  to  con- 
struct a state  model  of  the  form  (8a, b). 

The  computation  of  a realization  from  T(s)  is  not  difficult;  however, 
the  computation  of  a realization  with  the  number  of  coordinates  of  x(t) 
minimal  among  all  possible  realizations  Is  a nontrivial  problem  In  the 
multiterminal  case.  Such  realizations  are  said  to  be  minimal.  In  [7)  a 
constructive  procedure  Is  given  for  computing  minimal  realizations  defined 
over  RCd].  The  construction  Is  based  on  a method  for  computing  a basis  of 
module  R°[d]  from  a set  of  generators.  These  results  can  also  be  used  to 
reduce  overdetermined  systems  of  delay  differential  equations. 

c.  System  Properties.  As  before,  consider  a time-delay  system  given 
by  the  state  equation 

(10)  - F(d)x(t)  + G(d)u(t) 

where  F(d)  (resp.  G(d))  Is  a n x n (n  x m)  matrix  over  RCd].  Let  N(d) 
denote  the  n x mn  matrix  over  R[d]  given  by 

N(d)  - [G(d),F(d)G(d) F"‘^(d)G(d)]  . 

In  the  special  case  when  F(d)  and  G(d)  are  over  R,  so  that  the  system  (10) 
Is  finite  dimensional,  the  matrix  N(d)  Is  called  the  controllability  matrix 
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of  the  system.  This  term  arises  from  the  well-known  result  [13]  that  a 
flnlte-dlmenslonal  system  Is  reachable  (or  controllable)  If  and  only  If 
the  rank  of  N(d)  Is  equal  to  n (In  this  special  case  N(d)  Is  over  the 
field  R) . Motivated  by  this  result,  we  give  the  following 

Definition.  The  system  (10)  Is  reachable  In  the  strong  sense  (reap, 
weak  sense)  If  the  columns  of  N(d)  generate  the  module  R°[d]  (resp.  N(d) 
has  rank  n viewed  as  a matrix  over  the  quotient  field  of  R[d]). 

This  definition  was  first  given  by  Morse  [14].  Although  strong  and 
weak  reachability  are  not  equivalent  to  the  dynamical  properties  of 
Euclidean  and  functional  controllability,  they  (or  variants  of  these  con- 
cepts) are  related  to  certain  dynamical  properties  such  as  stabillzablllty 
(see  [15]).  The  concepts  of  strong  and  weak  reachability  are  particularly 
Interesting  because  It  Is  possible  to  determine.  In  an  algorithmic  fashion, 
whether  or  not  a given  system  has  these  properties. 

d.  State-Feedback  Control.  For  the  system  (10)  we  can  consider  state 
feedback  by  setting 

u(t)  ■ -K(d)x(t)  + r(t) 

where  K(d)  Is  a m x n matrix  over  R[d]  and  r(t)  Is  an  external  Input.  Note 
that  since  IC(d)  Is  over  the  ring  R[d],  we  are  allowing  time  delays  In  the 
feedback  loop.  This  Is  reasonable  since  the  given  system  (10)  Is  defined 
over  R[d].  However,  the  constraint  that  R(d)  be  over  R[d]  Is  too  severe. 

In  addition  to  "pure"  delays  given  by  elements  of  R[d],  we  need  to  consider 
distributed  delays.  That  la,  let 

(11)  u(t)  - -K(d)x(t)  - L(t-a)x(a)do  + r(t) 

t-h 

where  L(t)  Is  a m x n matrix  of  Integrable  functions  with  bounded  supports 
contained  In  [0,h],  h > 0.  The  last  term  on  the  right  side  of  (11)  Is  re- 
ferred to  as  a distributed  delay  operation.  As  an  example,  suppose  that 
L(t)  Is  a scalar  function  equal  to  1 for  0 t 5 h and  equal  to  0 otherwise. 


Then 


t t 

- o)x(o)do  - /^_j^x(o)do 


In  this  case  the  distributed  delay  operation  Is  the  finite-interval  Inte- 
grator. It  Is  Interesting  that  this  particular  operation  often  appears  In 
the  control  theory  of  time-delay  systems. 

For  feedback  controls  of  the  form  (11) , or  variants  of  this  form,  al- 
gebraic procedures  have  been  developed  for  eigenvalue  (or  pole)  assignment 
and  stabilizablllty  (see  [14,15,9]).  These  results  rely  heavily  on  the  al- 
gebraic properties  of  the  operator  ring  framework. 

A very  Interesting  and  Important  problem  Is  the  computation  of  feed- 
back controls  of  the  form  (11)  that  minimize  the  following  cost  functional 
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J ••  / [x^(t)Qx(t)  + u^(t)Pu(t)]dt 
0 

where  F and  Q are  symmetric  positive  definite  matrices  over  R.  It  Is  known 
Cl]  that  optimal  controls  can  be  computed  by  solving  a set  of  coupled  ordi- 
nary and  partial  differential  equations,  referred  to  as  the  Rlccatl  equations. 
But  there  Is  some  evidence  that  optimal  controls  can  be  expressed  In  terms 
of  the  operator  ring  structure,  which  could  result  in  simplified  procedures 
for  computing  controls.  This  problem  Is  currently  under  Investigation. 

e.  Duality  and  Observer  Theory.  Another  nice  consequence  of  the  oper- 
ator framework  Is  that  there  Is  a natural  concept  of  dual  system  arising  from 
notion  of  the  dual  of  a homomorphism  on  finite  free  modules.  The  dual  system 
Is  defined  as  follows. 

Definition.  Given  the  system 
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c R°,  y(t)  e R**  and  F(d),  G(d),  H(d)  are  n x n, 
over  R[d],  the  dual  aystem  la  given  by 

- F^(d)C(t)  + H’^(d)v(t) 

- G’'(d)t(t) 

c R°,  Y(t}  c r"*  and  T denotes  matrix  transposition, 
duality  theory  for  flnlte-dlmenslonal  systems,  the 
state-feedback  control  problem  for  the  dual  system  corresponds  to  the 
state-observation  problem  for  the  given  system.  Hence  results  on  state- 
feedback  control  In  the  dual  system  yield  results  on  state  observers  for 
the  given  system. 
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The  Structure  of  Groups  with  Index-3  Subgroups 

L.V.  Meisel,  D.M.  Gray,  and  E.  Brown 
Physical  Science  Division,  Watervliet  Arsenal, 
Watervliet,  New  York  12189 

For  any  group  G^  which  contains  an  index-3 
subgroup  G,  it  is  shown  that  either:  (a)  G is  an 
invariant  subgroup  or  (b)  G contains  an  index-2  subgroup 
where  G^  is  an  invariant  subgroup  of  G^.  For  case 
(a),  G and  its  cosets  give  rise  to  three  operators  which 
span  a stable  3-dimensional  subspace  of  the  group  algebra 
which  further  reduces  to  three  1-dimensional  stable 
subspaces.  For  case  (b),  G^  and  its  cosets  give  rise  to 
six  operators  which  span  a 6'dimensional  stable  subspace 
of  the  group  algebra  which  reduces  to  two  1-dimensional 
and  two  2-dimensional  irreducible  stable  subspaces  of 
the  group  algebra.  The  irreducible  representations  and 
the  corresponding  basis  elements  of  the  group  algebra 


are  given  for  both  cases. 


I.  INTRODUCTION 

In  this  paper  we  discuss  some  features  of  groups  containing 
a subgroup  consisting  of  one  third  of  the  group  elements.  We 
derive  the  structure  of  such  groups  and  give  explicit  irreducible 
representations  which  are  characteristic  of  them. 

Interest  in  such  groups  springs  from  their  relevance  to  the 
theory  of  second  order  phase  transitions.  In  their  classic  text 
on  Statistical  Physics,  Landau  and  Lifshitz^  make  the  statement: 

"It  appears  that  the  following  theorem  is  also  true:  No  second- 
order  phase  transition  can  exist  for  transitions  involving  the 
decrease  by  a factor  three  of  the  number  of  symmetry  elements 
(owing  to  the  existence  of  third-order  terms  in  the  expansion  of 
the  thermodynamic  potential)."  In  a recent  review  article 
Cracknell  conjectured  that  this  theorem  probably  could  not  be 
proven  in  the  general  case. 

Thus,  we  were  motivated  to  a general  study  of  such  groups 
and  to  a proof  of  Landau's  Theorem.  The  general  study  is  presented 
here;  the  proof  of  Landau's  Theorem  will  appear  separately.^ 

II.  STRUCTURE  AND  REPRESENTATIONS 

We  shall  denote  sets  of  group  elements  by  capital  letters  and 
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members  of  sets  by  corresponding  lower  case  letters,  e.g.  hcH  or 

hy^cH^.  The  order  of  any  such  set  will  be  denoted  as  an  absolute 

value,  e.g.^the  order  of  set  H is  denoted  |]l|.  We  shall  be 

discussing  a group  Gq  such  that  |Gq|=3N  containing  a subgroup  G 

with  1g|=N  and  the  distinct  left  cosets  H=hiG  and  K^kj^G.  In  an 

obvious  notation  G_=G+U+K. 

j o 

We  present  theorems  in  Appendix  I indicating  that  either  G is 
an  invariant  subgroup  of  G^  or  that  it  contains  a subgroup  G^^,  an 
invariant  subgroup  of  Gq,  where  |Gy^|=|G]/2.  In  the  latter  case  the 
cosets  H and  K divide  into  the  cosets  (Hy^  and  Hg)  and  (Ky^  and  Kg) 
with  respect  to  Gy^  and  the  remaining  eleipents  of  G are  members  of 
the  coset  Gg.  Table  I gives  a multiplication  table  for  the  various 
sets  of  elements;  its  derivation  is  described  in  Appendix  I.  The 
sets  Gy^,  Gg,  H^,  Hg,  Ky^  and  Kg  form  the  factor  group  G^/Gy^.  In  the 
event  that  G is  an  invariant  subgroup  G,  H,  and  K form  the  factor 


group  G^/G;  the  multiplication  table  for  these  sets  is  the  same  as 
that  in  Table  I for  Gy^,  Hy^,  and  Ky^. 

V.’e  may  furthermore  define  tlie  set  operators  ^ Sy^» 

Sa'^a 

^ S . Ma=I^aI  ^ ^ These  operators  will  have 

the  multiplication  table  given  in  Table  I where  we  now  read  each 


entry  in  the  table  as  the  corresponding  set  operator.  The  operators 


6A1 


sj'an  .1  6_-J linciisioiial  roiluciblc  stable  subspacc  of  tlic  j;roup 
alj^ebra'^  and  also  form  a six  clt'mciit  group.  The  six  cloinont 
group  contains  tlirce  classes;  G^,  (H^,  K^)  -and  (G^,  Kg). 

■flius,  one  can  find  two  1-d imcnsional  and  one  2-diinensional 
irreducible  represcjitation  of  the  six  element  group.  This 
corresponds  to  a reduction  of  tlie  6-dimcnsional  subspacc  of  the 
group  algebra  into  tv;o  1-dimcnsional  stable  subspaces  and  two 
2-dimonsional  irreducible  stable  subspaccs.  (Each  2-dimcnsional 
subspace  gives  rise  to  the  same  irreducible  representation.) 

'V  -1 

V.lien  G is  an  invariant  subgroup,  the  operators  G HjG|  2 g, 

^ gcG 

h,  aiid  K ;|g|  ^ i k form  a three  elcii'cnt  group  each 
hcH  ^ kcK 

cK-  ent  of  which  is  in  a class  by  itself  yielding  three  1 d i , lens  ioiial 
stable  subspaccs  of  the  group  algebra  and  three  1 -d.-'V.^nS'ional 
r. presentations. 

The  characters  of  the  representations  are  given  in  Table  II. 


It  is  straight  rorv.ard  to  show  that  wl-,en  G is  not  invariant  the 
i r .'.hic  ible  stable  s;;h,,.'aces  will  be  spanned  by  tliC  following  . ebers 


group 

algebra: 

1+  • 

1-  • 

'U  'V  w /V 

A_^G^+H,+K 
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Kl  = /l/G  (2n^-lU-Kj 

>\,  — 

= /1/2  (H^-KJ 

<V#  I ^ I ■ 'V/  'Xrf 

Ej  = /r/2  (H_-K_) 

0. 1 ^ '\j  '\l 

=-/l/6  (2G 

'V  0/  'V  ^ 'V  'V 


'V  'V  'w  'u  'w  'V  'V  'V 

where  G^=G  *G„,  H iH„,  and  K K ^K. 

+ AB’+aB'  +AE 


I 

The  normalization  factors  in  the  basis  eloments  of  r2  and  r2 


have  been  chosen  to 

produce  the 

following 

unitary  i'* 

jpreson' at i on 

-v  ri  o\ 

ij 

i 

D(H  )i 

A 

t:  b 

DCy. 

C b 

■v  /l  0\ 

{ 

-a  b \ 

/ -a  -b\ 

-i) 

B(Hp.  ( 

a) 

Dcya 

U J 

where  a = 1/2  and  b 

- /T/2  . 

It  is  interesting  to  note  how  the  2 dii.ionsional  representation 


'X/  ^ ^ 

reduces  when  G is  an  invariant  subgroup.  Here  G^-G^-G_,  etc.  and 


the  complex  conjugate  pair  of  basis  eK  onts: 

<0  'V  — I 'V  "v  *''' 

E,+iE-  = /2/3  (G  +u  H +10  K ) 

1 ^ A A ^ 


^nd  Ej-iE^  = (G^+o)  H^  + u K^) 


where  u = exp  (2tii/3)  are  seen  to  span  l-dimensional  stable  subspaccs 

* 

of  the  group  algebra  with  characters  given  by  P and  P of  Table  Ila. 
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(Tlie  third  stable  subspnee  corro^jioixl  i riy  to  the  i ijn  iit  at  ion  P 
.will  be  spanned  by  A =G^*-)I^  + Ky^ . ) In  this  case  Hj  .md 

basis  elements  for  the  appropri.tte  real  pseudo -2-d imens ional 
irreducible  representation  obtained  upon  invoking  tine  reversal 
invariance, 

A simple  application  of  tliesc  ideas  to  tlie  tlieory  of  continuous 
phase  transitions  is  presented  in  Appendix  II. 
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T.ihlc  I.  Mill t ipl ic.'it  ion  tabic  for  llio  set  operators.  The  table  yives 
the  i)ro'liict  operation  of  an  elonent  (set)  on  the  left  and  an  clement  (set) 


at  the  lop,  c.g. 


Table  Ila.  Character  table  for  the  irreducible  representations  of  the  set 
operator  group  for  the  case  that  |GgI=0..  (a)=exp  (2iii/3)) 


I 

i 


I 


Table  lib,  Cl'.aracter  table  for  the  irreducible  representations  of  the  set 


APPIINOIX  I.  •illl;OKl>IS 


All  of  the  theorems  prcsejitcd  Iicre  pertain  to  the  group 
which  contains  the  subgroup  G and  the  distinct  left  cosets  H and 
K as  defined  in  the  main  text.  IVe  give  proofs  only  for  a select 
few  of  the  theorems;  the  omitted  proofs  are  similar  to  those 
given.  (N.B.  In  these  theorems  tlie  subscripts  i,  j denote 
specific  group  elements.) 

Theorem  1.  If  g^hj  e H then  and  g^K=K.  (N.B.  Not 

every  r.e.ril'er  of  G will  satisfy  the  hypothesis  of  tlieorem  1 unless 
G is  an  ijivariant  subgroup.) 

Proof:  g^H=g^hjG-hgG-H.  'Hie  first  and  tliird  equalities 
follow  from  the  definition  of  left  cosets.  The  second  equality 
follows  from  the  hypothesis  of  the  theorem.  gj^G-G  since  G is  a 
group,  riius  g^K=K  by  the  rear  ranger. ent  theorem^.  QED 

Tlic-orem  2.  If  gjhj  c K then  g^H-K  and  g^K=H. 

Define  G.  and  G as  the  sets  of  elements  g-  satisfying  theorems 
1 and  2 respectively. 

Theorem  3.  G=G.+G  . 

A o 

Theorem  4.  If  h^hj  e K tlien  h^li-K  .and  h^K=G. 

Tiic-orem  5.  If  h^hj  c G then  hpK-G  and  h^K=K. 

Define  and  Hg  as  the  sets  of  elements  h^  satisfying  the 
Iiypotiicscs  of  theorems  4 and  5 respectively. 


Tlieoi-cin  6.  IIH  +H  . 

A B 

Theorem  7.  If  k.k.  e H tlien  k.K-H  and  kJJ=G. 

1 j 1 1 

llieorem  8.  If  k.k.  e G tlien  k . K=G  and  k.lI-H. 

^ J ^ ^ 

Define  K and  K as  the  sets  of  elements  k satisfying 
A B 1 

the  liypotheses  of  theorems  7 and  8 ^respectively . 

Theorem  9.  K-K.+K  . 

A B 

Theorem  10.  The  multiplication  table  for  the  sets  defined 
by  tlieorems  1 to  9 is  given  by  Table  I. 

Theorem  10  can  be  proven  as  follows: 


The  sets  G. , G , H , etc,  have  been  defined  in  terms  of  the 

A'  B A 

result  of  operating  from  the  left  on  arbitrary  eleinents  of  the  sets 

fg  1 r ill 


G,  H,  and  K.  If  h. . is  an  elcsnent  of  H then  h II  I-  IK  etc. 

Ai  A Ai  [cj 

Using  these  properties  we  can  find  the  set  membership  of  any  prcduct , 


For  exa^iiple. 


Ai  Bj  Ik 


H = h.  Ill  = K = g. 


i.e.,  the  product  of  an  arbitca.ry  cles'i-nt  of  H and  an  arbitrary 

A 

eleirent  of  (in  the  order  given)  yields  an  element  of  Gg.  By 
repeated  application  of  these  tec’.nicpies  the  entire  multiplication 
table  is  generated. 


Theorem  11.  Either  |G  |=1g|  or  |G  |=|G|/2. 

A A 

Fioof:  Either  |Gg|=0  or  jGjjlf  0. 


If  |C|,|  .0  tlica 

If  iGgltO  then  gg^  = g^  e G^  for  all  j.  (Table  I), 
licnce,  |G.  |-|G  |.  ('Hie  rearrangement  theorem  implies  that 

A D 

no  element  of  will  be  repeated  as  gg^  luns  over  G^.)  Also 
^Bs  ^Aj  " ^Bm  ^ 

Thei'efore  |Gg|  = |G^|  and,  since  G=G^+Gg  (from  theorem  3), 


|g^|=1g|/2. 


QED 


Theorem  12,  |G. |=lH  1=IkJ  . 

A A A 

Tlioorem  13.  G is  an  invariant  subsu-oup, 

A 

Left  and  right  f'oscts  with  respect  to  can  be  seen  to  be 
identical  by  cxPiinniiion  of  Table  I. 

^ ^ 'V  ^ ^ 'V 

Dicorem  14.  Hie  ojicrators  G ,G^,H  ,K  . and  have  the 

ABABA  B 

i"ul  t ipl  i cat  ion  table  of  Table  I, 


The  proof  of  this  theorem  goes  as  follcv.-s:  Consider,  for 

% f\,  iv  % 2 

exaJi  ple,  the  operators  H.  and  K . H,  K = G h k 

^ ^ A B A B ‘ a'  3,s  Aj  Bs 

I ^ ^ 8n  • last  equality  follows  from  Table  I as 


|G 


A'  j j t 


Bt 


ajiplied  to  sets  of  group  operators  G , G , H etc.  (theorem  10), 

ABA 

the  rearrangement  tliceirc-m  "and  theorejns  11  and  12.  Thus, 

"v  ] 'V, 

H K = G g 3 =G  . The  entire  table  for  eierators  now  can  be 
A B ' a'  j B B . . . 

generated  in  this  v.ay. 

II  'V  -v  "V 

Tlieorem  15.  If  iGgl  =0,  then  G^,  H^,  and  form  a three  elc.r.ent 

group  with  each  cl.  .nt  in  a class  by  itself. 

Tlieorem  16.  If  !G„!^0,  then  G.  ,G  ,H  ,H  ,K  , and  K form  a six 
' B't  a’  b a b a B 

•V  ‘V  'V 

clement  group  with  classes:  G^,  (H^,K^),  and  f^g'^g»*^g^' 
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APPEiNDIX  II.  A SIMPLE  APPLICATION 


An  example  of  the  application  of  these  ideas  is  found  in 
the  theory  of  continuous  phase  transitions,  hhen  a solid 
transforms  from  a structure  of  3N  symmetry  operations  to  one 
of  N operations  one  is  interested  in  a function  invariant  with 
respect  to  the  smaller  group  of  symmetry  operators  which  may 
also  serve  as  a basis  function  for  an  irreducible  representation 
(other  than  the  identical  representation)  of  the  larger  group. 

One  sees  that 

'V#  'X#  'V 

= Ej  (I)  = /1/6  (2  -K  )<t' 

'''I 

(or  4’  -Ej  <f  ) will  be  an  acceptable  function  by  exainination  of  the 
matrix  representation  of  r2  (or  r2^  • particular  case  of 

a transition  from  a crystal  having  the  s)TTiri,eLry  of  the  full  cubic 
group  to  one  having  the  symiiietry  of  the  tetragonal  group  w])ich 
singles  out  the  z-axis  one  finds  by  taking 
t 

'I'  =■  /1/6  ^ (2z2  - x2  - y2) 
which  has  the  partner  function 

4>'  = ApP  (?i+  - K^)  P =.'*^72'  (x2  - y2) 

i.e.,  the  representation  corresponds  to  the  representation  of 
the  full  cubic  group  denoted  in  BSW^  notation.  An  extensive 

discussion  of  the  transformation  from  cubic  to  tetragonal  syimetry 
may  be  found  in  Perel,  Batterman  and  Blount^. 
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0.  N.  Cray]  Thera  Is  a definita  connection  between  this  and  one  aspect 
>f  our  work  on  superconductivity.  The  A-lS's  (Cr^sL  structure)  are  an 
laportant  class  of  hlgh-T^  superconductors;  many  of  the  A-15's  undergo 
a cubic  to  tetragonal  phase  transition  at  temperatures  somewhat  above 
T^.  If  ono  considers  only  the  point  group  of  the  lattice  this  is  a 
factor  of  three  reduction.  Thus,  when  such  transition  is  second-order, 
there  must  be  a further  lowering  of  the  symmetry  (by  relative  movement 
of  the  basis'  atoms)  to  be  consistent  with  Landau's  theory.  Symmetry 
considerations  can  be  used  to  limit  the  types  of  internal  movement. 

See  Perel,  Batteruan,  and  Blount,  Phys.  Rev.  166,  616  (1968). 
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